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The present invention relates to a novel DNA fragment encoding non-A non-B hepatitis-specific antigen 
polypeptide which is found at the time of infection or onset of the non-A non-B hepatitis. 

It also relates to an expression vector containing said DNA fragment to a transformant transformed with 
said expression vector and to an expressed polypeptide obtained by culturing said transfonmant 
5 It further relates to a single strand DNA sequence for PGR primer synthesized on the basis of a partial base 

sequence of said DNA fragnr>ent 

It also relates to the use of said expressed polypeptide and said single strand DNA sequence in detection 
of the non-A non-B hepatitis vims. 

10 BACKGROUND OF THE INVENTION 

Non-A non-B hepatitis is an infectious disease which is caused by a masked virus other than A and B hepati- 
tis viruses, but it is not easy to identify the virus because an arrKHjnt of the virus-specific antigens is very small 
in a patient's body as well as of anti-virus antibodies. Accordingly, diagnosis of non-A non-B hepatitis has been 

15 made serologically by the well-known "diagnosis by exclusion" method wherein increase in the levels of alanine 
aminotransferase and aspartate aminotransferase is determined for a serum from a patient to make a diagnosis 
whether or not the hepatitis belongs to any of hepatitis A, hepatitis B, hepatitis D and other hepatitis symptoms 
due to the known hepatopathy-causing viruses such as CMV, EBV, etc, and if the result of diagnosis are not 
applicable to them, then a case is identified as non-A non-B hepatitis. K however, is difficult to diagnose dini- 

20 cally as being non-A non-B hepatitis by such a method because there is no cwrelatton between ALT value and 
non-A non-B hepatitis. Also, the lack of trustworthy means for the diagnosis is a serious problem, whereby a 
secondary infection with the non-A non-B hepatitis virus which may be caused by transfusing blood, especially, 
from a non-A non-B hepatitis virus-carrying healthy carrier into a person can hardly be prevented. Therefore, 
it is considered that the non-A non-B hepatitis occupies more than 90% of hepatitis cases caused by blood 

25 transfiiston, with a total of at>out one mil ton patients per one year. 

In order to improve such situation and to raise a diagnostic accuracy of non-A non-B hepatitis. Altar's panel 
in which a standard serum is used has been developed by Alter at al at the NIH. Diagnostk: materials which 
can pass the Altar's panel have been obtained by Arima ef a/[JIKKEN IGAKU (Japan), 7 (2). 1 96 - 201 (1989)] 
and by M. Houghton et al (WO 89/04669, PCT/JP90/500880) of Chiron Corp. almost simultaneously. Arima ef 

30 a! have screened the sera from hepatitis patients using Xgtll (a protein expresston vector) which is derived 
from viral RNA finom a non-A non-B hepatitis patient's serum. Also, Chiron Corp. have inoculated the patienrs 
blood plasma into a chimpanzee to develop a chronic hepatitis, blood plasma being obtained from the diseased 
animal which possesses the anti-virus antibodies with high titer, and then have screened in the same way as 
Arima etaL Chiron Corp/s group has also succeed in doning almost the whole portton of the gene of a hepatitis 

35 C virxis (HCV, designated by Chiron Corp.) and devetoped a kit for diagnosis whteh comprises aaantigen protein 
obtained by expressing a part of the HCV gene. 

In spite of such an efVbrt however, fectors of this disease, even their numbers, have not yet been elucklated 
to the full. 

As described above, the two materials which can pass the Alter'a panel has certainly lead to a new tech- 
no nlque of diagnosis replaced by said "diagnosis by excluston", but screening patienf a sera separately with the 
materials gives rK> resulta to be satisfied because both the materials from Arima ef a/ and Chiron Corp. react 
with patient's sera fai tow positive ratk>s of about 60 to 80% 6n6 about 50 to 70%, respectively. In other words, 
in some cases, these materiais would not react with sera from the patients who have been diagnosed d inlcally 
as norvA nor>-B hepatitis. A virus commonly have a functton to cause mutatton in their host cells for tfieir surval. 
45 and thus the viral genes Isolated from American patients by Chiron Corp. had been possibly mutated into vari- 
ous forms aodbnated to the chimpanzee as an Infisction tntermediato. 

Accordingly, a great demand has been d^ected to a large scale preparatton of the reactive antigens which 
are capable of probing the nOrvA non-6 hepatttis patients or carriers, therafior it wm be necessary to construct 
effective cDNA dones through the isolation and purificatton of variously mutated vfrai RNA from many non-A 
50 non-6 hepatitis patients. 

In addition. In the case of sera which have feled in a trustworthy diagnosis using an anta>ody detectton 
system, or of sera whk:h ara cdtected Inmediately after infectton and in which antibody titers do not yet raise, 
a gene amplification method (PCR method) may be useful for the conflnmatton of the disease because it can 
detect a trace anrKXint of viral genes. Also, it is possible to done the genes efftoientiy by the PCR method. How- 
55 ever, since the PCR method is carried out using primers whteh are synthesized from a known gene sequence, 
it is not always possible to detect a gene of the non-A non-6 hepatitis virus in a patienf s flukj using a prirner(s) 
which can be constructed on the basis of the HCV gene sequences determined by Chiron Corp., if a difference 
in mutatton between sakj HCV gene of Chiron Corp. and sakj patient-carried viral gene is signifkmnt 
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In consequence, to detect efficientJy infection wrth the non-A non-B hepatitis vims, it is necessary to prepare 
at least one printer capable of detecting the viral gene with a high specificity. Such a purpose may t>e acconrv 
pltshed by isolating a great number of cDNA clones, synthesizing primers from relatively preserved regions 
among their gene sequences, and subjecting the primers obtained to screening through the PGR method. 

5 

SUMMARY OF THE INVENTION 

This invention provides a novel ONA fragment which encodes a non-A non-B hepatitis-specific antigen 
polypeptide originated from a non-structural or structural protein of the non-A non-B hepatitis virus, the potypep- 
10 tide being formed at the time of the infection or onset of the non^ non-B hepatitis. 

This invention also provides an expression vector containing the DNA fragment, a transfbnmant transfbr- 
med with the expression vector, an expressed polypeptide obtained by culturing the transformant, and a pro- 
cess for its production. 

This invention further provides a primer for use in the detection of non-A non-B type hepatitis virus genes. 
15 This invention further yet provides use of the expressed polypeptide or single strand DNA primer in detec- 
tion of the non^ non-B hepatitis virus, and a method for the detection of non-A non-B type hepatitis virus genes 
and anti-non-A non-B type hepatitis vinjs antibodies. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 

Fig. 1 shows a nucleotide sequence of norvA non-B hepatitis-specific cDNA which is encoded in a done 
011-7 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ ID 
NO. 1. 

Fig. 2 shows a nucleotide sequence of non-A non-B hepatitis spectfkM:DNA which is encoded in a done 
25 010-11 and an amino acid sequence deduced from the sequence. These sequences are equvalent to SF.Q ID 
NO.Z 

Fig. 3 shows a nudeotide sequence of non-A non-B hepatitis specific-cDNA which is encoded in a done 
010-13 and an amino add sequence deduced from the sequence. These sequences are equvalent to SEQ ID 
NO. 3. 

30 Fig. 4 shows a nudeotide sequence of non-A non-B hepatitis spedfic-cDNA which is encoded in a done 
010-14 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 4. 

Figs. 5 shows a nudeotide sequence of non-A non-B hepatitis-specific cONA which is encoded in a done 
010-15 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
35 ID NO. 5. 

Fig. 6 shows a nudeotide sequence of non-A non-B hepatitis spedfic-cDNA which is encoded in a done 
010-16 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 6. 

Fig. 7 shows a nudeotide sequence of non-A non-B hepatitis spedfic-cDNA which is encoded in a done 
40 01 0-1 7 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 7. 

Figs. 8 shows a nudeotide sequence of norvA nor>-B hepatitis-spedfic cONA which is encoded in a done 
010-18 and an amino acid sequence deduced finom the sequence. These sequences are equivalent to SEQ 
ID NO. 8. 

45 Fig. 9 shows a nudeotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a done 
010-19 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 9. 

Figs. 1 0 shows a nudeotide sequence of non-A non-6 hepatitis-spedfk: cONA which is encoded In a done 
010-21 and an amino acid sequence deduced from the sequence. These sequences are equhmlent to SEQ 
60 ID NO. 10. 

Fig. 1 1 shows a nudeotide sequence of norvA non-B hepatitssnspedfic cONA which is encoded in a done 
010-22 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 11. 

Fig. 12 shows a nudeotide sequence of non-A non-B hepatitis-spedfic cDlsiA which is encoded in a done 
55 010-23 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 12. 

Fig. 13 shows a nudeotide sequence of non-A non-B hepatitis-spedfic cDNA which is encoded in a done 
010-35 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
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ID NO. 13. 

Fig. 14 shows a nucleotide sequence of non-A non-B hepatitis-specrfic cDNA which is encoded in a done 
C11-C21 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 14. 

5 Fig. 15 shows a nudeotide sequence of non-A non-B hepatitis-specific cONA which is encoded in a done 

C10-E12 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 15. 

Fig. 16 shows a nudeotide sequence of non-A non-B hepatitts-specrfic cONA which is encoded in a done 
C10-E13 and an amino acid sequence deduced from the sequence. These sequences are equivalent to SEQ 
10 ID NO. 16. 

Fig. 17 shows a nudeotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a done 
C10-E24 and an amino add sequence deduced from the sequence. These sequences are equivalent to SEQ 
ID NO. 17. 

Fig. 1 8 shows a nudeotide sequence of non-A non-B hepatitis-specific cDNA which is encoded in a done 
f5 C10-E15 and an amino add sequence deduced from the sequence. These sequences are equh^ent to SEQ 
ID NO. 18. 

Ftg. 19 shows a flow sheet for the construction of an expression plasmid Trp TrpE C1 1-7. 

Fig. 20 shows a flow sheet for the construction of an expression plasmid Trp*TrpE*C1 1-C21 . 

Fig. 21 is a photograph showing the results of westem blotting analysis of an expressed product, TrpE-d 1- 
20 7. with serum from a normal person or non-A non-B hepatitis patient, wherein the antigens used are a purified 
antigen in A, an extract of expressed cells in B, and an extract of nor>^pressed cells in C. 

Fig. 22 is a photograph showing the results of westem t>lotting of an expressed product, TrpE CI 1-C21. 
with sera (A, B) from two normal persons or norv-A non-B hepatitis patients. 

Fig. 23 is a graphical representation of the EUSA-detenmined positive numt)ers in Table 4. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

Many aspects and advantages of the present invention will be made apparent to those skilled in tiie art by 
the following detailed description about preferred embodiments of the invention. 
30 The present invention provides a specified DNA fragment comprising a base sequence which encodes a 
non-A non-B hepatiti&^pedfic antigen polypeptide from a non-structural or structural protein of the non-A non-6 
type hepatitis virus. 

In preparation of the DNAfragnr>ent of the present invention, it is characterized that variously mutated genes 
of pathogenic viruses were dlrectiy collected from fresh blood plasnna pools of a number of non-A non-B hepati- 
35 tis patients. More particularly, ttte preparation comprises the steps in which total RNA nwlecdes Induding non- 
A non-B hepatitis virus RNA are tedated from the blood plasma pool, cDNAs are eynttiesized based on the 
isolated RNA molecules by the well-known random primer method, and then the cONAs obtained are Incorpo- 
rated Into X phage to prepare a cDNA library. The cDNA library is subsequenfly immunoscreened using sera 
from a non-A non-8 hepatitis patient to obtain the DNA fragments of Interest Thereafter, using the resulting 
40 DNA fragments as probe, cDNA libraries obtained firom the blood plasma firom several chronic non-A non-B 
hepatitis patients were subjected to hybridization assay in order to Isolate a cONA whk:h has different horTK)logy 
from ttie known counterparts and which is ^)ecific for the non-A non-6 hepatftis patient 

Such a process makes it possible to provMe the viral antigens whk:h are markedly useful for the diagnosis 
of non-A nor>-B hepatitis patients carrying the variously mutated viruses and for the improvement of detection 
45 accuracy of the hepatitis viruses contained in blood for transfusfon whteh %vas collected from many latent car- 
rlera carrying non-A non-B type hepatitis viruses. 

The following describes the present Invention in detal with regard to the preparation of cDNA library, iso- 
lation and sequencing of DNAiragments, expression and isolation of potypeptMea, and their application to diag- 
nosis of non-A non-6 hepatitis using enzyme-linked imn>unosort>ent assay (EUSA) or PGR: method. 

50 

Preparation of cDNA library 

Ftrstiy. cell debris is removed from each of fresMy collected blood plasma samples of several norvA non-6 
hepatitis patients by centrifugation and the resulting supernatant is again subjected to centrifugation at a higher 
55 speed of rotation to obtain a pellet The peRet is subjected to an equRibrium density gradient centrifugation using 
cesium trifluoroacetate to isolate total RNA as a predpitate. and the total RNA is purified by phenol/chloroform 
extraction and etiiand predpHation. 

By the method of Gubler and Hoffrnan using random primers, cDNA is synth^ized from the above RNA 
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fraction. The cDNA is methylated by treating it with a DNA methylase (for example, EcoRI methytase), connec- 
ted with a DNA linker (for example. EcoRt linker) or DNA adapter (for example, EcoRI adapter), and then doned 
into a cloning vector such as X phage (for example, Xgt10 or Xgt1 1) to prepare a cDNA library. 

5 Isolatton and sequencing of DNA fragments 

Next Escherichia colt is infected with the X phage cDNA library and cultured on an agar plate to form pla- 
ques. These plaques are transferred on a nitroceltutose filter, and subjected to blocking followed by immunosc- 
reening using a non-A non-6 hepatitis serum in order to detect positive clones. Alternatively, to improve efft- 

10 ciency of the screening, each positive done obtained is doned into a doning vector such as ptasmid and a 
^-labeled DNA probe is prepared by random primer technique, and then positive plaques are detected from 
the aforementioned cDNA library using the probe. 

Eighteen dones in total were obtained by the above procedure and designated as C1 1-7. C10-11 . C10-1 3, 
C10-14. C10-15. C10-16. C10-17. C10-18. 010-19, C10-21, C10-22. C10-23, C1(V35, C11-C12. C10-E12. 

f5 C10-E13. C10-E24andC10-E15. 

A cDNA sample is obtained from X phage DNA of each 18 dones in a traditional manner and digested with 
appropriate restnction enzymes such as EcoRI and Ba/nHI. Each cONA fragment obtained is purified by agar- 
ose gel electrophoreste, incorporated into a sequendng vector (M13 phage), and then subjected to the dkjeoxy 
chain termination nrwtiiod [Sanger et at Proc, Nat!, Acad. Scr.. USA, 74, 5463 (1977)] in order to detemiine a 

20 base sequence of each cDNA fragment. 

Nudeotide sequences of these dones and deduced amino add sequences are shown in Figs. 1 to 18 and 
in a Sequence Listing which will be described later as SEQ ID NOs. 1 to 18. That is, the SEQ ID NOs. 1 to 16 
respectively represent the nudeotide and deduced amino add sequences detenmined from dones C11-7, C10- 
11, C10-13, C10-14, C10-15, C10-16, C10-17, C10-18, C10-19, C10-21, C1&-22, C10-23, C10-35. C11-C12, 

25 C10-E12, C1(VE13, C10-E24 and C10-E15. Also, ttie base pair (BP) number of ttieir DNA fragments is 763 
BP, 615 BP. 771 BP, 630 BP, 1426 BP. 655 BP, 315 BP, 911 BP, 489 BP, 1076 BP, 284 BP, 641 BP, 432 BP. 
369 BP, 932 BP, 559 BP, 276 BP and 742BP, respectively. 

All the 18 clones contained a continuous open reading frame but with no termination codon. 

Analysis of genomic RNA has revealed that hepatitis C virus (HCV) Is a dass of virus simflar to tiie genus 

30 f7avfv/fussuch as Japanese encephalitis virus [Pmtein, Nudeic Add and Enzyme (Japan), ^(12), 21 17- 2127 
(1 990)]. From the comparison of homology between ttie reported gene and polypeptide of FlavMms and those 
of ttie present invention, it vras found tttat dones C1i-C12, C10-E12, C10-E13, Cia-E24 and C10-E1 5 encode 
a stmctural protein of the non-A non-B type hepatitis virus. More particuariy, done C11-C12 is a gene which 
encodes the core of non-A non-B hepatitis virus, and dones C10-E12, 010^13. C10-E24 and C10-E15 are 

35 genes encoding a region from the latter half of the vims core to the env or a regk>n downstream from the env. 
Other dones were found to be genes encoding non-etructuial proteins of the virus. 

The nudeotide sequences of the above 18 dones aruJ the anrtino ackj sequences translated along the open 
reading frantes showed homok>gles witii those of hepatitis C virus (HCV) reported by Houghton et al (EP-A- 
318,216, 1988). In ottier words, dones C11-7, C10-16, 010-17, C10-18, C10-19. C10-21. 010-22 and 010-23 

40 showed relatively high honrK>logies wttti HOV: 80 to 82% homology at nudeic add level and 91 to 94% at amino 
add level. In addition, these dones showed more higher homologies with the sequence J1 reported by Miyam- 
ura et aL {Nuc. Ad, Res., 17, 10367 - 10372, 1989): 85 to 95% homology at nudeic add level and 87 to 100% 
at arrtino add level. These clones were dasstfied as group 1 because of high homology In their overiapped 
portion. On the contrary, dones 010-11, 010-13, 010-14, 010-15 and 010-35 showed low homologies when 

45 compared to ttie nudeotide and amino add sequences of HOV and J1, Le., 69 to 70% hontology at nudeic 
add level and 75 to 80% at amino add level, they were ttierefore dassified as group 2. 

In addition, when the 369 BP nudeotide and deduced 123^ino add sequences, indicated as SEQ ID 
NO. 14, for ttie 01 1-021 ddna encoding a structural protein of ttie virus were compared witti ttie portions over- 
lapped witti HOV reported by Hoi^hton et al (WO 90/1 1089), a nudeic add homology or81.8% and an amino 

50 add homology of 87% were found. Also, when compared witti HOV dones, H0-J1 and HO-J4, obtained from 
a Japanese patient (Okamoto et a/.; Japan J. Exp. Med., 3, p. 167 - 177, 1990), homologies of 82.1% and 
82.7% at nucleic add level and 87.8% and 89.4% at amino add level were shown. Since ttie same regions 
among ttie reported ttiree dones (HOV by Houghton ef a/, and H0-J1 and H0-J4 by Okamoto et a!.) have high 
homologies of 9Z1 to 97.6% at nudeic add level and 95.5 to 96.7% at amino add level, it has been found ttiat 

55 ttie done 01 1 ^21 obtained by the present inventora has a certain distance from the reported dones in terms 
of homdogy and ttierefore is a different group of viral gene ttierefrom. The remaining 4 dones. 01 0-E1 2, 01 0- 
E13 and 010-E15. showed homdogtes of 83 to 93% at nudeic acid level and 82 to 95% at amino acid level 
when compared witti ttie HCV. HO%J1 and HO>l4. while O10-E24 showed around 63% of homdogy at nudeic 
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acid level and around 60% of homology at amino acid level. 

However, no homology was found either at nucleic acid level or amino acid level, when the DMA fragnwnts 
of the present invention were compared with any DNA fragment encoding non-A non-B hepatitis antigens which 
have been disclosed in Jpanese patent Application Laying-Open (KOKAI) Nos. 89/2576 and 89/124387. 
5 Consequently, the dones 010-11. C10-13. C10-14. C10-15. C10-35, C11-C12. and C10-E24 have low 

homologies with the reported clones both at nudeic acid and amino acid levels. Other clones are also distiguish- 
able from the reported dones. 

Therefore, the present invention provides a DNA fragment comprising a base sequence which encodes a 
non-A non-B hepatitis-specrftc antigen polypeptide, said polypeptide consisting of the whole or a part of the 
10 amino acid sequence which is encoded along the open reading frame and represented by any one of the SEQ 
ID NOs. 1 to 18. 

Naturally, the base sequences according to the present invention indude any other base sequence which 
comprises other codons corresponding to each amino add. 

Anx>ng the aforementioned dones. C1 1-7. CI 0-1 1 . CI 0-1 3. CI 0-1 4. C1 0-1 5, C1 0-1 6. CI 0-1 7. C1 0-1 8 and 

15 C10-19 weretransfomned intoE. co// HB101 strain and deposited on July 6. 1990 with Femnentation Research 
Institute. Agency of Industrial Science and Technology. 1-3 Higashi 1<hon>e, Tsukuba-shi. Ibaraki-ken, 305. 
Japan, respectively as E. co// HB101/C11-7 (Accession Number PERM P-11589). E. co// HB101/C10-11 
(PERM P-11581). E. coli HB101/C10-13 (PERM P-11582), E co// HB101/C10-14 (PERM P-11583), E col! 
HB101/C10-15 (PERM P-11584). E. coll HB101/C10-16 (PERM P-11585). E coli HB101/C10-17 (PERM P- 

20 11586). E coll HB101/C10-18 (PERM P-11587) and E co// HB101/C10-19 (PERM P-11588). These deposi- 
tions were subsequently converted on June 13. 1991 to an International deposition under Budapest Treaty by 
the same depositary institutton as an international depositary authority set forth in Budapest Treaty to be given 
the following new Aocesston Numbers: 

25 E. call HBIQI AcceSflion WQ, (FERM BP") 



Clone 


Cll-7 


3442 


Clone 


ClO-11 


3434 


Clone 


ClO-13 


3435 


Clone 


ClO-14 


3436 


Clone 


ClO-15 


3437 


Clone 


ClO-16 


3438 


Clone 


ClO-17 


3439 


Clone 


ClO-18 


3440 


Clone 


ClO-19 


3441 



45 Also, dones 011-021, C10-E12, C10-E13, C10-E24 and C10-E15 were transfbrmed Into £ co// JM109 
strain and deposited on December 11.1 990 with Pennentation Research Institute. Agency of Industrial Science 
and Technology, the same address, respectlvety under Accession Numbers PERM P-11892, PERM P-11894. 
PERM P-11895, FERM P-^96 and PERM P-11897. These depositions were also subsequently converted 
on June 1 7, 1 991 for done 01 1 -01 2 and on June 1 3, 1 99 1 for other clones to an International deposition under 

so Budapest Treaty in the same way. The following new Aocessi<»i Numbera were given: 



55 
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g. coli ,TMinQ Accession Nn . ^F ERM BP-> 



10 



Clone 


C11-C21 


3450 


Clone 


C10-E12 


3444 


Clone 


C10-E13 


3445 


Clone 


C10-E24 


3446 


Clone 


C10-E15 


3447 



As described in the foregoing, the DMA fragments according to the present invention are different from any 
15 other prior DMA fragment Though non-A non-B hepatitis vims is generally divided into two classes, namely 
groups 1 and 2, on the basis of the comparison of homology between the clones encoding a nonstructural reag- 
ion of the hepatitis virus, there is a possibOity of existing an intennnediate group or even a third group because 
the virus is very susceptible to mutation In its host cells. It may be accordingly drfFicult to correctly diagnose all 
the non-A norvB hepatitis patients using an antigen protein prepared from only one kind of DNA fragment In 
20 order to overcome such a problem and to bnprove an efficiency of the diagnosis, it is necessary to establish 
such a useful process for the preparation of Oh4A that a number of effective dones can easily be obtained, and 
to use several types of dones in combination in diagnosis 

Expression of non-A non-B hepatitis specific antigen polypeptide 

25 

The present invention also provides an expression vector which is constructed by introducing the above- 
mentioned DNA fragment Into a doning site downstream of a promoter gene in a vector. 

Any conventional vector nr^ay be used such as plasmid, phage or the like. An expresston vector may be 
constructed by the well*known techniques in the art The following describes some processes for constructing 
30 the expression vectors of the Invention. 

Construction of expresston plasmM Trp-TrpE'C11-7: 

A flow sheet for the construction of the expression plasmid Trp-TrpE-C1 1 -7 is shown in Rg. 19. 

35 Firstiy, a plasmkl pUC-C1 1 -7 DNA obtained by incorporating the done C1 1 -7 into pUC1 1 9 is digested with 
restriction enzymes SamHI and Seal, and the resulting SamHl - Seal fragnDent is Isolated by agarose gel 
etectrophoTBSis and then purified by a glass powder technk^ue. Separately from this, an expresston vector 
Trp TrpE DNA digested wtth SamHl and Seal, treated with a bacterial alkaline phosphatase (BAP), and then 
extracted with phenol. The aqueous layer obtained is sut>sequentty subjected to ethanol predpltation to obtain 

40 a treated vector DNA. By connecting the vector DNA with the aforementioned C1 1 -7 DNA fragment in the pre- 
sence of T4 DNA llgase, the expresston plasmid Trp TrpE-C11-7 is obtained in which the DNA encoding the 
non-A non-B hepatitis^specffto antigen is located downstream of a promoter so that transcription of the DNA 
can be controlled tyy the pnxnoter. 

45 Construction of expresston plasmto Trp-TrpE-C1 1-021: 

A flow sheet for the oonstructton of the expresston plasntto Trp-TrpE C1 1 -C21 to shown In Fig. 20. 
FlTBtiy, a DNAfragnteriFdontalnlng a stop codon In its 3* terminal is prepared from a plasntkj pUC-CI 1-C21 
DNA whtoh is obtained by Incorporating the C1 1^1 done into pliC1 1 9, by a gene ampliftoation method (PGR) 

60 using two primers (5'-TTACGAATTCATGGGCACGAATCCT-3' and S'-TTAATCGATGACCTTACCCA- 
CATTGCG^O- By ligating ttie thus-prepared DNA fragment witti pUC1 18 whtoh is predigested witti Smai, a 
plasmto pUC118d1-C21-S/na is obtained. This plasmkl is then digested with EcoRI and Sa/nHI, and the 
resulting DNA fragment is isolated by agarose gel electrophorests and then purified by glass powder technique. 
Separately from tills, an expresston vector Trp-TrpE DNA (Japanese Patent Application No. 90/180889) is 

55 digested witii BamHI and EcoRI, treated with a bacterial alkaline phosphatase (BAP), and ttien extracted witti 
phenol. The aqueous layer obtained is suttsequentiy subjected to ehand predpitation to obtain a treated vector 
DNA. By connecting the vector DNA with the aforementioned C1 1 -C21 DNA fragment by the action of T4 DNA 
ligase in a ligation buffer sdution, the expr^ston plasnr^ Trp TrpEd 1-C21 is obtained in whtoh the DNA-en- 



7 



EP 0 468 657 A2 



coded polypeptide from a structural protein of the non-A norvB hepatitis virus is located downstream of a pro- 
moter so that transcription of the DNA can be controlled by the promoter. 

Other clones can also be made into corresponding expression plasmids by treating each clone with approp- 
riate restriction enzymes and introducing the treated fragment into an expression vector. 
5 When a procaryote is used as the host cell, a promoter eligible for use in the present invention may be 

selected from promoters originated from E, coh\ phage and the like, such as tryptophan synthase operon (trp). 
lactose operon (lac), X phage Pl, X phage Pr and the like. When an eucaryote such as yeast is used as the 
host cell, promoters for 3-phosphoglycerate kinase and other glycolysis-related enzymes (Holland et al; 
Biochemistry, 17: 4900. 1978) may be useful. Though not always required, a transcription tenmination factor 
10 may preferably be located in the expression vector. 

The vector may further contain a marker sequence, such as an ampicillin or a tetracycline resistance gene, 
which makes it possible to effect a phenotype setectkin in transfomned cells. 

The present invention also provkies a transfonnant whch is obtained by inbtxiudng the expressk>n vector 
of the invention into a host celt. Microorganisms used commonly in this field, such as E. coii, 6. subUlis, a yeast 
15 strain and the like, may be used as a host cell. 

Transformation may be effected by any usually used means for the inoorporatx>n of an expressk>n vector 
into host cells. When a bacterium (for example, £. co/r) is used as host cell, a direct incorporation technique 
with the use of calcium chloride (Mandel, M. and Hlga, A; J. Mol. Bio., 53^ 159 - 162, 1970) way be employed. 

In addition, the polypeptide of the present Inventton may be produced by inoculating and culturing a suitable 
20 host cell carrying the expresston vector in an appropriate medium such as ampicillin-containing 2YT medium 
and then propagating expressnn cells by subculturing them In an ampicillin-containing phosphate medium. 

Productton and purification of recombinant non-A non-B hepatitis-specific antigen polypeptMe 

25 The present invention also provMes a process for producing a non-A non-B hepatitis-specific antigen 
polypeptkje. which comprises the following steps of: 

constructing a repticable expression vector which can express the aforementioned DNAfragnwntof the 
present inventk>n in an appropriate host cell; 

obtaining a transfonmant by incorporating said expression vector into the host cell; 
30 producing a recombinant potypeptkie by culturing saki transformant under such conditions that saM DNA 

fragment can be expressed; and 

recovering said recombinant polypeptide. 
The crude polypeptide product from host cells may be purified by disintegration of the host cells, tor example 
by ultrasonk: disintegratk>n. subjecting the disintegrated cells to centrifugatk»n to obtain an insoluble fractton 
35 containing a fused potypeptkle between TrpE as signal peptkle and a polypeptkto encoded by cDNA syn- 
thesized from a non-A non-B hepatitis virus RNA, extracting the fused polypeptkle in a soluble fbrm with a urea- 
containing bufVier, and then purifying the extracted polypeptkle by subjecting it to an ton exchange column 
chromatography (S-Sepharose, for example). 

Accordingly, the present Inventton also provkJes a recombinant non-A norvB hepatitis-epedfic antigen 
40 polypeptkie obtained by such a expresston process, saki polypeptMe consisting of the whole or a part of the 
amino add sequence represented by any one of the SEQ ID NOs. 1 to 18. 

The term 'Recombinant non-A nort-B hepatitis-spedfic antigen polypeptMe" as used herein is intended to 
Indude a pdypeptkje itself which is obtained by expressing In a vector a DNA fragntent encoding a non-A norvB 
hepatitis-specific antigen polypeptkle, and a fused polypeptide obtained by fusing said polypeptide with other 
45 peptide such as a signal peptkto. 

Appllcatton to dlagnosb of non-A non-B hepatitte 

The expressed polypeptkie of the present inventk>n was subjected to SDS-polyacrylamkJe gel 
so electrophoresis and then alk>¥ved to perform antigen^ntibody reactk>n wtth each two serum sonples from nor- 
mal persons or non-A norvB hepatitis patients by means of western blotting, whereby this polypeptkle reacted 
strongly with only the patient's sera as shown In Figs. 21 and 22. It was confirmed therefore that the expressed 
polypeptkie functions as a norv-A non-B hepatitis-spedfic antigen. 

Accordingly, the present inventton also provkies a method for immunological detectkm to detect an anti- 
55 body directed against the non-A non-B hepatitis virus antigen ,whk:h comprises the fdkming steps of. 

incubating a sample possibly containing an anti-non-A non-B hepatitte virus antibody together with at 
least one recombinant nor>-A non-B hepatitis-spedfic antigen polypeptide of the present bvention under such 
conditions that the antigen s capat>le of reacting immunologically with the antibody; and 
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detecting an antigen-antit)ody complex. 
Diagnostic effects (positiveness) of tfie expressed polypeptide TrpE-C11-C21 obtained by expressing the 
expression plasmkJ Trp TrpE -011-021. another expressed polypeptide TrpE 0 11-7 obtained by expressing the 
corresponding expression plasmid Trp-TrpE 011-7, and an assay kit of Ohiron Oorp. (ORTHO HCV Ab ELISA 

5 kit) were examined by the conventional enzyme immunoassay through the reaction of the above expressed 
antigens with a serum sample from a patient who has been diagnosed dinicalty as being non-A non-B hepatitis. 
As the results, positiveness of the kit of Ohiron Oorp. was found to be 69.7% (23/33 cases) while the TrpE O1 1 -7 
which belongs to group 1 showed a positiveness of 78.8% (26/33 cases). In the case of the expressed polypep- 
tide TrpE O1 1-021 . it showed a positiveness of 84.8% (28/33 cases) which is higher than the case of the Ohi- 

10 ron*s kit. When the expressed polypeptide TrpE 011-7 as a member of group 1 and the TrpE O1 1-021 as a 
member of group 2 were used in combination, the positiveness increased to 93.9% (30/31 cases; see Table 1 
and Fig. 23). 

Therefore, according to an embodiment of the present inventk>n. there is provided a combination of the 
group 1 and group 2-relating expressed polypeptides as a hepatitts-specific antigen pdypeptkie for use in the 
15 immunological detection. 

The present invention further provides a method for gene amplrficatksn which comprises amplifying a non-A 
norv-B hepatitis virus gene using sense and/or antisense sequence synthesized on the basis of the DNA sequ- 
ences of the present invention. 

As the synthetic base sequence for POR primer, the following single strand DNA sequences may be env 
20 ployed: 

5'-<3GATAGACCGGTGACTTTGA-3'(sense. SEQ ID NO. 19); 
5'-TGOATGCACGTGGOGATGTA-3'{antisense, SEQ ID NO. 20); 
5'-GATGOOOAOTTOOTCTOOOA-3'(sense, SEQ ID NO. 21); and 
5'-GTOAGGGTAAOOTOGTTGGT-3'(antisense. SEQ ID NO. 22), 
25 said sequences being sense or antisense of the partial base sequence represented by the SEQ ID NO. 5 for 
the fomier two primere and by the SEQ ID NOs. 2, 4. 5 or 1 3 for the latter two primere. These specified printers 
are also within a scope of the invention. 

The single strand DNA sequences may be synthesized by the usual methods such as phosphorous ackJ 
nnethod, phosphotrlester method, solkj phase method and the like, though the use of a DNA synthesizer is most 
30 convenient 

When used as a POR primer, the above single strand DNA sequences show higher spedfk:ity for the group 
2 virus genes than for the group 1 virus genes (see Tables 2 and 3). 

Therefore, the present invention also provides a method for detecting the genes firom the norvA non-6 type 
hepatitis virus in a fluid sample such as serum, which comprises the following steps of: 
35 isolating RNA from the sample, 

synthesizing cONA by treating the obtained RNA with a reverse transcriptase, 
subjecting the obtained cDNA to polymerase chain reaction using at least one the above-mentioned pri- 
mer; 

detecting an amplified non^A non-B type hepatitis virus gene. 
40 The present Invention further provides use of the expressed polypeptkjes or single strand DNA sequences 
for POR primer of the present Invention in the detection of the non^A non-B hepatitis virus. 

The following examples are given to further illustrate the present invention in detail, but it is not intended 
to lin^ the invention thereby. 

45 Exantple 1 

Preparation of cDNA libraiy from blood plasma of non-A non-B hepatitis patient 

A cDNA library was prepared using XgM 0 and Xgtl 1 phages after preparing an RNA fraction In the following 
50 manner from fresh blood plasma pools obtained from several Japanese patients of chronic stage non-A norvB 
hepatitis. 

Five liter of blood plasma was diluted with the equal volume of 50 mM Tris-HOi (pH 8.0) containing 1 mM 
EDTA, cell debris in the diuted sample was renrKyved by centrifiigation at 3,500 g for 20 minutes and then the 
resulting supernatant was again subjected to centrifugation at 45,(X)0 rpm (atKxit 100,000 g) for 4 houre at a 
55 temperature of 4"^ to obtain pellet The pellet was dissolved, according to the conventional procedure, in 6M 
guantdium thiocyanata as a protein denaturating agent, layered over a solution of cesium trifluoroacetate. and 
then subjected to centrifiigation using Beckman SW50 rotor at 33.000 rpm for 16 houre at a temperature of 
20^0. The resulting pellet was dissolved in 10 mM Tris-HOI (pH 7.5) containing 1 mM EDTA and extracted twice 
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with a sotvent system of 1:1 phenolxhloroform, afterwhich ttie organic layer was mixed with 1/10 volume of 5 
M NaCI and 2.5 volunr)es of ethanol. After standing the mixture for 2 hours at -20^C, it was centrrfiiged at 15,000 
g for 20 minutes and the pellet was then dissolved in diethyl pyrocarbonate-treated water to use as an RNA 
sample. 

5 In accordance with the method of Gut)ler and Hoffman. cDNA was synthesized from the thus obtained Rh4A 

sample by means of random primer technique using a commercially available kit (from Anwsham or BRL). The 
cDNA was subsequently treated with EcoRI methylase. ligated with an EcoRI linker or an EcoRI adapter and 
then doned into the EcoRI site of XgtIO and Xgt 11 phages. The cONA library thus prepared contained 10® to 
10^ PFU of recombinant phages in average. 

10 

Example 2 

Isolattori of non-A non-B hepatitis-specific cDNA 

IS An attempt was made to isolate cONA specific for non-A non-B hepatitis from the cDNA library prepared 
in Example 1, by immunoscreening and hybridization assay. 

Firstly. immunoscTBentng of Xgtl 1 library was carried out using two serum samples from non-A non-B 
hepatitis patients which are negative for HBc and HBs antibodies and which contain antibodies specific for the 
hepatitis-caustng virus. Immunoscreening was perfbnmed in the usual way by examining specific reactton of a 

20 p-galactosidase-fused recombinant peptide with a serum sample of non-A non-B hepatitis (to be refenred to as 
"NANBH" herein after) patient 

Cells of E. CO// Y1090 strain were mixed with Xgtl 1 cDNA library at a predetermined ratio, plated on an 
agar medium at an appropriate density, and then incubated at 43^*0 for 3 hours to form plaques. Next, the agar 
plate was covered with a Hyt)ond-C nitroce!luk)se filter which has been soaked with 10 mM IPTG and the fil- 

25 ter-covered plate was incubated again at 37*K^ for 3 hours to induce expression. Sut)sequently, the nitrocellul- 
ose filter was subjected to blocking using 3% gelatin solutk)n, reacted with a serum sample of NANBH patient 
overnight at 4*^0, and then, after washing, reacted with a peroxklase-labeled anti-human IgG (goat antibody). 
A positive signal was found when the resulting fater was reacted with a mbcture of diaminok>enzidine and H2O2. 
This done, C11-7. did not react with HBc and HBs antibodies. 

30 Next, in order to improve efficiency of the screening, the done C1 1-7 was re-doned Into pUC1 19 and made 
Into a probe by random primer method. Using the probe, Xgt10 cONA library was screened by nr^eans of hyk>- 
ridization assay. Screening was carried out according to the conventranal method by plating 5x10* PFU of 
recombinant phages with E. coll C-600 hfl(-) on an L-plate (150 mm dish). When plaques appeared after over- 
night Incubatton of the plate at 37^C, the plate was stored at 4^*0 for 1 hour and thereafter the plate was covered 

35 with a Hybond-N filter for a period of 30 secorxJs. The resulting filter was superposed for 1 minute on a filter 
prewetted with a denaturating solution (0.5 M NaOH and 1.5 M NaCI)^ soaked for 5 minutes in a neutralizing 
solutkMi (0.5 M Tris-HCI pH 7.0 and 1 .5 M NaQ), washed with 2 x SSC, and then dried. The fUter was subjected 
to UV-crosstinking by exposing it to UV rays (304 run) for 2 minutes. Thereafter, as described below, the result- 
ing fiter was subjected to screening by hyt)r1dizatk>n assay using a ^-labeled DNA probe which has been 

40 prepared by random primer nrwthod from the C1 1-7 done obtained by Immunoscreening with a serum from 
NANBH patient 

The filter was incut»ated overnight at 65^*0 In 1 x SSC, washed twtee with 1 x SSC at 65X (1 0 minutes for 
each) and then subjected to autoradiography at -70^ for the detection of positive plaques. Each positive plaque 
was transferred Into SM buffer and used as a phage stock. Clones ot>tained were used as martcer probe to carry 
45 out a series of screening. As the results, 13 dones In total were isolated and designated as C10-11, C10-13, 
C10-14. C10-15. C10-16, C10-17, C10-18, C10-19, C10-21, C1&-22, Cia.23 and C10-35. 

Example 3 ^ 

50 Selection Isolatkyi of group 2 non-A non-B hepatitis^pedfic cDNA 

A bbod plasma sample which can read only with C10-14 done was obtained by subjecting fresh blood 
plasma of a Japanese patient in a chronic phase of the non-A rran-B hepatitis to an EUSA-t)ased screening 
system, using expressed products of the group 1 cDNAdone C1 1-7 and the group 2 cONA done CI 0-14 iso- 
55 lated in Examples 1 and 2. This bk>od plasma sample was subjected to a gene amplification method (PCR 
method) using wen preserved primers of group 1 and those of group 2. PCR nrathod was carried out using Gene 
Amp^ (DNA Amplification Reagent Kit, Pwldn Elmer Cetus) under condittons of: DNA denaturatk>n, OS'^C for 
1.5 minutes; annealing. 55**C for 2 minutes: and DNA synthesis, 70®C for 3 minutes. Blood plasma santples in 
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which gene amplification was found only with the use of the group 2 phnners under these conditions were pooled 
for further use. An RNA fraction was prepared fronr> one liter of this fresh blood plasma sample in the same 
manner as in Example 1 . and a cONA litxary (referred to as "cONA library A" hereinafter) was constructed using 
Xgt10 and Xgt1 1 phages. The cONA library A contained 10® to 10^ PFU of recombinant phages in average. 

5 On the other hand* a cDNA library B was constructed using Xgt10 phage from five liters of fresh blood 

plasma samples which have been coliected as starting material from several patients of non-A non-B hepatitis 
and have not been subjected to the EUSA/PCR method, in the same manner as described above. The cDNA 
library B also contained 10® to 10^ PFU of recombinant phages in average. 

Cloning of non-A non-B hepatitis-spectfic cDNA from cDNA library A was carried out by immunoscreening 

10 in the same manner as in Example 2. and a positive plaque (done CI 1-C21) was obtained. The done C1 1-C21 
showed no positive reaction with HBc and HBs antibodies. 

In order to improve efficiency of the screening, the thus obtained done CI 1-C21 was re-doned into 
pUC1 19. digested with restriction enzymes, and then made into a ^^Aabeled probe by random primer labeling 
method in the same manner as in Example 2. Using the probe obtained, the cONA library B was screened by 

15 hybridization assay. After a series of the screening efforts. 4 clones were isolated and nanied C10-E12, C1 0- 
E1 3, CI 0-E24 and C1 0-E1 5. 

Example 4 

20 Sequencing of norvA non-B hepatitis-specific cDNA 

£. coii cells were infected with the Xgt1 1 or XgtIO phage of each of the 18 dones obtained in Examples 2 
and 3 to recover respective phage in a large quantity. DMA was extracted from the phage by the conventional 
alkali method, digested with a restriction enzyme EcoRI. BamHI or Kpnt. and the resulting DNA fragments were 

25 purified by agarose gel electrophoresis. Separately from this, sequendng vectors mp18 and mp19 of M13 
phage (Messing, J.; Methods in Enzymology, 101, 20 - 78) or pUC118 and pUC119 (Vietra^. and Messing, J.; 
Methods in Enzymology, 153. 3-11) were digested with a restriction enzyme EooRI, BamHI or Kp/^ to obtain 
linear vector fragments. The cDNAfragnoentand the vector Dh4A were linked together using T4 ligase In a buffer 
solution, and the resulting reaction product was incorporated into E. oofi HB101 or JM109 strain by transfbr- 

30 matton or transfectton. Resulting £ co// cells were cultured and DNA was recovered by alkali method. Nuc- 
leotide sequence of the DNA obtained was detenmined according to the dkJeoxy chain terminatkHi method of 
Sanger ef a/. 

The nudeotkle sequences of dones C10-11, C10-13, C10-14, C10-15. C10-16, C10-17. C10-18. C10-19, 
C10-21, C10-22. C10-23, C10^5, C10-C21, C10-E12. C10-E13, C10-E24 and C1Q.E15 and the amino acW 
35 sequences deduced from these nudeotide sequences are shown in a sequence table as SEQ ID NOs. 1 to 1 8 
and also in Figs. 1 to 18. 

On the basis of the comparison of homologies anoong these sequences and the nudeotide and deduced 
amino add sequences disdosed by Houghton efa/.(WO89/04669. PCTyjP9Q/500880) and Miyarmira eta!{Nuc 
Ad. Res., IL 10367.10372(1989)), dones C1 1-7, C10-17, C10-18, C10-19, C1(V21, Cia-22 and C10-23 

40 obtained in Example 2 were dassified as group 1 done as defined hereabove while dones CI 0-11, CI 0-1 3, 
CI 0-1 4, CI 0-15 and C1 0-35 were daastfled as group 2 dones. Evry of these 13 dones encoded non^structural 
protein of the non-A non-B type hepatitis virus. Moreover, done C10-C21 in Example 3 was dassified as group 
2 from the oomparteon of homology with the sequences described by Houghton et al (WO9(V11089) and 
OkanK>toetal(J^n«/. £xp.Med,60, 3, pp.167-177 (1990)). but dassfficatkHiofthe dones CI 0-E12. C10-E13, 

45 C10-E24 and C10-E15 bi Exan4>le 3 is not stfll dear. However, was found that these 5 dones encode the 
stiuctural protein of non-A non-B hepatitis virus from the comparison of homology with the reported genome 
of RaWv//us (Protein, Nudeic Add and Enzyme (Japan), ^{M), 2117-2127 (1990)). 

Examples 

50 

Expression and purification of polypeptMe encoded by non-A non-B type hepatitis vims cDNA 
(i) Constnjctton of expresston plasmM TrpTfpE C11-7: 

55 One of the dones isolated, C1 1-7, was expressed as a fused pdypeptkie with TrpE in E. ooli under the 
control of Trp promoter(see Fig. 19). 

Firstly, 1 ^g of a plasmki pUC-C1 1-7 DNA which has been obtained by incorporating the C1 1-7 done into 
pUC1 19 was digested by incubating it at 37"^ for 1 hour in 20 ^l of a restrictton enzyme reactton solutton (150 



11 



EP 0 468 657 A2 

mM NaCI. 6 mM Tris-Ha (pH 7.9). 6 mM MgD2* 1 ^ ""i^ BamH\ enzyme and 1 5 units of Seal enzyme]. There- 
after, a BamH\-Sca\ fragment of atKXit 700 bp was obtained by subjecting the resulting reaction solution to 0.8% 
agarose get electrophoresis, and the fragment was purified by glass powder method (Gene Clean Blo-101). 
One \ig of Trp TrpE DNA which is an expression vector was digested by incubating it at 37^C for 1 hour in 

5 20 ni of a reaction solution (150 mM NaCI, 6 mM Tris-HCI (pH 7.5). 6 mM MgCl2, 15 units of BamHI enzyme 
and 15 units of Seal enzynrte]. After adding 39 ^1 of water, the resulting reaction solution was heat-treated at 
70°C for 5 minutes, mixed with 1 ^1 (250 U/^l) of a bacterial alkaline phosphatase (BAP) and then incubated 
at 37*'C for 1 hour. The reaction solution was subsequently extracted with phenol, the aqueou layer was sut>- 
jected to ethanol precipitation followed by drying of the precipitate. One ^g of the BamHI-Scal-treated vector 

10 DNA obtained and the above C1 1-7 DNA fragment was added to 5 ^1 of 10 x ligase buffer [660 mM Tris-HCI 
(pM 7.5). 66 mM MgClz. 100 mM dithiothreitol and 1 mM ATP] and 1 ^1 of T4 DNA ligase (350 U/^l), and water 
was then added to the mixture to 50 ^1 of the final volume. Thereafter, the thus prepared mixture was incubated 
overnight at 16^C to complete ligation. 

E. cdi HB101 strain was transfonmed with 10 of the resulting reaction solution. Competent £. coli strain 

IS for use in the transfbmriation was prepared by calcium chloride technique [Mandel.M. and Higa A; MoL BioL, 
53; 159 - 162 (1970)]. The transformed E. co// strain cells were spread on an LB-plate (1% trypton. 0.5% yeast 
extracts. 0.5% NaCI and 1.5% agar) containing 25 \ig/m\ of ampicillin and incubated ovemight at 37*>C. One 
loopfiil of each colony grown on the plate was transferred Into a liquid LB medium containing 25 (ig/ml of ampicil- 
lin and cultured ovemight at 37^*0. Cells In 1.5 ml of the cultured medium were collected by oentrrfugation. and 

20 Miniprep of ptasmid DNA was carried out by alkali method (Maniatis et al; Molecular Cloning: A Laboratory Man- 
ual, 1 982). One ^g of the plasmid DNA obtained was digested at 37''C for 1 hour In 20 of a reaction sohjtk)n 
[150 mM NaCI. 6 mM Tris-HO (pH 7.5). 6 mM MgClz. 15 units of SamHI and 15 units of Seal]. Thereafter, the 
digested solution was subjected to agarose gel electrophoresis to obtain an expression plasmid Trp TrpE-d 1-7 
which can produce the 700 bp BamHI-Scal fragment This plasmid was transfbrmed into E. coli HB101 strain 

25 and deposited on July 6, 1990 with Fennentation Research Institute, 1-3 Higashi 1-chome, Tsukuba-shi, Iba- 
raki-ken. 305. Japan, under the Accession Number FERM P-11590 (nanr>ed E. coli HB101/Trp TrpE C11-7). 
This deposition was subsequently converted on June 13, 1991 to an intematk>nal deposition under Budapest 
Treaty t>y the same depositary Institution as an international depositary authority set forth in Budapest Treaty 
to be given the new Accessk)n Number FERM BP-3443. 

30 

(ii) Expression and purification of polypeptide encoded by done C1 1-7 : 

E. cdi HB101 strain transfbnmed with the expresston plasmkj TrpTrpE C11-7 was inoculated Into 3 ml of 
a Ik^uid 2YT medium (1.6% trypton, 1% yeast extracts and 0.5% Naa) containing 50 (ig/mt of ampicOlin and 

35 cultured at 37^C for 9 houre. One ml por^n of the cultured broth was inoculated Into 1 00 ml of a Itquki M9-CA 
n)ed(um (0.6% Na2HP04, 0.5% KH2PO4, 0.5% NaCI, 0.1% NKUO, 0.1 mM CaOx, 2 mM MgS04. 0.5% casamino 
add and 0.2% glucose) containing 50 \igM of ampkillln and cultured at 37X for 21 hours. A 16^ poftk>n of 
the resulting culture broth was then Inoculated Into 1.2 1 of the M9^ medium and cultured at 37"^. When 
turbklity at ODm of the culture broth reached 0.3, indde aoylate was added to a final ooncentratkMi of 40 mg/l, 

40 and the culturing was continued for addittonat 16 houre. Cells collected from the final culture broth by centrifii- 
gation were suspended In 20 ml of buffer A [50 mM Tris-HCI (pH 8.0), 1 mM EDTA and 30 mM NaCQ and the 
cell suspension was again subjected to oentrifugation to obtain 2.6 g of expressed cells. The thus obtained cells 
were suspended In 10 ml of the buffer A, disintegrated by ultrasonic treatment and then subjected to centrifu- 
gation to obtain an insoluble fraction containing a fused polypeptide of TrpE with a polypeptide which is encoded 

45 by the norvA non-B type hepatftis vims cDNA. The fused polypeptide in the insoluble fraction was sdubatzed 
and extracted using 1 0 ml (tf the buffer A containing 9 M urea. Thereafter, the solubQized extract was subjected 
to an S-Seph«ose Ion exchange column chromatography with an NaCt gradient of from 0 M to 0.5 M to purify 
ttie fused polypeptide ^ 

60 (ill) Construction of expression plasmid Trp-TrpE'C11-C21: 

The done CI 1-C21 was expressed as a fused polypeptide wttii TrpE in £ coi? under tt>e control of a pro- 
nrK>ter (see Fig. 20). 

Firatiy. 1 ng of plasmid pUC-C11C21 DNA which has been obtained by incorporating C11-C21 done into 
55 pUC1 19 was subjected to PCR method using two primers (5'-TTACGAATTCATGGGCACGAATCCT-3' and 5'- 
TTAATCGATGACCTTACCCACATTGCG-3'). PCR method was carried out using Gene Amp™ kit (DNA Ampli- 
fication Reagent Kit. Pertdn BnDer Cetus) under reaction conditions of: DNA denaturation. 95'*C for 1 .5 minutes; 
annealing. 50*'C for 2 minutes: and DNA synthesis, 70°C for 3 minutes. DNA fragments thus obtained were 
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separated by 0.8% agarose gel electrophoresis and purified by glass powder technique. Separately from this, 
pUC 118 was digested with a restriction enzyme Sma\ and then ligated with the DNA fragn>ent obtained by PGR 
method in a buffer solution containing T4 ligase to obtain a plasmid pUC1 1 8-C1 1 -C21 Sma. One ^g of the plas- 
mid DNA obtained was digested at SZ^'C for 1 hour in a restriction enzyme reaction solution [150 mM NaCI. 6 

5 mM Tris-HCI (pH 7.9), 6 mM MgCl2. 1 5 units of EcoRI enzyme and 1 5 units of SamHI enzyme]. Thereafter, the 
resulting reaction mixture was subjected to 0.8% agarose gel electrophoresis to isolate an EcoRI-BamHt frag- 
ment of about 380 bp which was then purified by glass powder technique (Gene Clean™. Bia-101). 

Next ligation and transfbnmation were carried out sut>stantiatty in the same manner as in the aforemen- 
tioned procedure (r) except that restriction digestion of the expression vector TrpTrpE DNA was carried out 

10 using EooRI and SamHI instead of fla/nHI and Seal. Thereafter, an expression plasmid TrpTrpE-d 1-C21 
which can produce the EcoRI-BamHI fragment of about 380 bp was selected by agarose gel electrophoresis 
purification. This plasmid was transformed into E. co//HB101 strain and deposited on December 11. 1990 with 
Fermentation Research Institute. Agency of Industrial Science and Technology, the same address, under the 
Accession Number PERM P-11893 (named £. cofi HB101/TrpTrpE C11-C21). The deposition was also sub- 

15 sequentJy converted on June 17. 1991 to an intemational deposition under Budapest Treaty by the same 
depositary institution as an intemational depositary authority set forth in Budapest Treaty to be given the Acces- 
sion Number PERM BP-3451. 

(iv) Expression and purificatton of polypeptide encoded by done C11-C21: 

20 

Expression and purification of a fused polypeptide were carried out substantially in the same manner as 
in the aforementioned procedure (11), except that the expression plasmid Trp TrpE C11-C21 obtained by the 
above procedure (iii) was used instead of Trp*TrpE*C1 1-7. 

25 Example 6 

Measurement of anti-non-A non-B type hepatitis virus antibody in semm from non-A non-B hepatatis patient 

(i) Measurenwnt by western blotting: 

30 

The expressed product obtained and purified in Example 5 was subjected In tum to SDS-polyacrylamide 
gel electrophoresis [Laemmli; Naturo, 277. 680 (1 970)] and to blotting on a nitrocellulose filter (Bio-Rad. Trans- 
blot) In usual way. The filter was biocked wttii a 3% gelatin solution and then reacted with each serum samples 
from normal persons or non-A non-B hepatitis patients. After washing, the resulting fater was reacted with a 

35 peroxidase-labeled human IgG (goat antibody). Thereafter, the filter was washed again and soaked in a solution 
containing diaminobenzkjine as reaction substrate to conftnm color development 

The results are shown in Pigs. 21 and 22. In Pig. 21, the expressed polypeptide TrpE*C11*7 (group 1) 
obtained In Example was used as antigen, and In Pig. 22, the expressed polypeptide TrpE-C1 1-C21 (group 
2) In Example 5-(lv) was used. In each case, no reaction was ot)served with a nonnal serum sample, but a strong 

40 reaction with a patient's aenm sample was found with a specific band. 

(ii) Measurement by enzyme-linked immunosort>ent assay (EUSA) : 

EUSA can be used as a means to make diagnosis of a large number of serum samples as co mpar ed to 

45 ttie case of western Uotting method. EUSA was canied out as follows: 

A purified antigen sample was diuted with PBS(-) to a oonoentratk>n of 5 (ig/nrl and fbced to a mioo-plate 
at 4^*0 or room temperature. After washing several times witii a washing solution, a diluted semm sample to 
be detected was added to tile resulting plate and incubated for 1 hourat37''C or room tentperature. After wash- 
ing, peroxidase-labeled antMiuman IgG (goat antibody) was added and incubated at d7X or room temperature 

50 to complete the reaction. After washing several times, 50 \s\ of a diamlnobenzldine solution was added and incu- 
bated at 37**C to develop color. Thereafter, the coloring reaction was stopped with 2 M H2SO4 and the odor 
was nneasured by a ookufmeter. 

Positive ratios in the case of the use of the expressed polypeptide antigens, TrpE C11-7 (group 1) and 
TrpE-d 1-C21 (group 2), <tf the present invention were compared with the case of the use of a commercially 

55 available kit of Chiron Corp. (Ortho HCV Ab ELISA Test). As shown in Table 1 . ttie use of the Chiron's kit resulted 
in 69.7% of the positive ratio, while positive ratios in the case of the use of the TrpE-d 1 -7 and TrpE-d 1-C21 
were 78.8% and 84.8%, respectively Moreover, the positive ratio increased to 98.9% (30 of 31 cases) when 
these two expressed pdypeptkJes of the present invention were used in combination (see Pig. 23). 
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Example 7 

Detection of non-A non-B type hepatitis virus group 2 gene in blood plasma from non-A non-B hepatitis 
patient by RT-PCR 

5 

RT-PCR was carried out as follows: 

To 100 ^1 of a blood plasma sample collected from a non-A non-B hepatitis patient was added 300 of a 
6 M GTC solution (6 M guanidine thiocyanate, 25 mM sodium citrate, 0.5% sarcosyl and 0.2 M 2-mercap- 
toethanol), and the mixture was stinred. To this were further added 40 ^ of 2 M sodium acetate (pH 5.2), 400 

10 ^l of phenol and 80 ^1 of chlorofomVisoamyl alcohol (49:1), and then thoroughly stirred. Aqueous solution layer 
separated from the mixture was mixed with isopropyl alcohol and then subjected to centrifugation. Synthesis 
of cDNA was carried out using the pellet as a source of RNA. For the cD^4A synthesis, an RNase inhibitor and 
a reverse transcriptase were added to a reaction solution containing 10 mM Tris-HCI. 0.01 % gelatin, 1 mM each 
d^>ITP, 4 mM MgCl2, 1 mM DTT and 100 pmole each primer, and the mixture was incubated at 37^C for 2 hours 

15 to complete the reaction. Then, PGR was carried out using the cDNA obtained. In order to increase sensitivity 
and spectfictty for the detection of bands, a two step PGR method was employed, that Is, first PGR using two 
primers (1st step PGR) and subsequent PGR using two primere which exist inside the first PGR product (2nd 
step PGR). For the PGR reaction, each amplification cyde was carried out using 100 (il of a reaction solution 
containing cONA. 10 mM Tris-HGI, 0.01% gelatin. 2 mM each dMTP, 1.5 mM Mga2 and 50 pmoU each primer. 

20 under reaction conditions of: denaturation. 94"^ for 1 .5 minutes; annealing, 50*^ for 2 minutes; and chain elon- 
gation, 70*'G for 2 minutes. The amplification cyde was repeated 35 times. Effects of several primers were 
evaluated. As the results, it was found that the group 2-specific DNA fragments are capable of being detected 
by the use of the following 4 primere: 
1st step PGR 

25 1*21 : 5'-GGATAGAGGGGTGAGTTTGA-3' 

1*22: 5'-TGGATGGAGGTGGGGATGTA-3' 
2nd step PGR 

kk26: 5'-GATGGGGAGTTGGTGTGGGA-3' 
1*27: 5'-GTGAGGGTAAGGTGGTTGGT-3' 
30 By applying these 4 prinrtera to the PGR method, a DNA fragment of 206 bp can be detected. As a control, 
primere were synthesized from the base sequence of J1 and detection of group 1 DNA fragnnents was 
attempted. Results of the PGR from blood plasnra samples of non-A non-6 hepatitis patient are shown in Table 
2. 

It was luiown that DNA fragnnents from the non-A non-B hepatitis virus can be detected by both the PGRs 
35 using group 1 primers (Le„ group 1 PGR) and group 2 priniere (i.e.. group 2 PGR), and therefor two samples, 
Nos. 3 and 5, which are considered to indude both groups 1- and 2-relating viruses were sequeneod for their 
viral genes. As shown in Table 3. wtten nudeotide sequences of DNA fragments obtained by group 2 PGR were 
compared with C10-13 which is a group 2 done, homologies of 85% and 88% were observed. Indicating effec- 
tive detection of group 2 genes. When these two nudeotide sequences were compared with ttte aforementioned 
40 groupl doneJI (Miyanmiraetal. supra), only 64.8% and 68% honriologies were observed. Results of the homol- 
ogy evaluation Indicate that the primere used in the group 2 PGR can selectively detect group 2 viral genes. 



45 



so 



55 
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Table 1 



Sample No 



TrpE -011-7 
(group 1) 



TrpE-Cll-C21 Kit of Chiron 
(group 2) Corp. (group 1) 
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15 



20 
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30 
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33 
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40 Note: NK and NP are negative controls. 
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Table 2 

Sample No. Group 1 PGR Group 2 PGR 





1 


+ 


— 
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10 
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+ 




4 . 
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— 
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15 
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— 
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+ 
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+ 




20 


10 


+ 






11 


+ 


+ 




13 




+ 


25 


42 




+ 




169 


+ 


+ 




260 








244 






30 


248 




+ 




NC 







35 



Table 3 



Sample No. 


Nucleotide homolocyy with clone ClO-13 


3 


85% 


5 


88% 



45 

As seen from the fofegoing examples, the present Invention has the following advantages: 
The cDNA sequences aOoording to the present invention are specific to non^A non-6 hepatitis, and polypep- 
tides which are produced ty Incorpor a ting these genes Into a protein expression system in microbial host cells 
so such as E. CO// can react immunologicatly with sera samples from a numt)er of non-A non-B hepatitis patients, 
wheret>y a Idt for diagnosing non-A non-B hepatitis is capable of preparing with maricedty high sensitivfty and 
Judging accuracy. Also, it is possible to malce diagnosis of this disease using said sequences as a prot>e directly 
or other probes with higher specificity synthesized on the basis of the sequences. In addition, not only diagnosis 
of the disease but also isolation of non-A non-B hepatitiSHspecific genes can be accomplished by employing a 
55 gene amplification mettiod (PGR method). 
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Segoeoce Listios 



10 



IS 



20 



25 



30 



35 



40 



45 



60 



SEQ ID NO:i 

SEQUENCE LENGTB:763 base pairs 
SEQUENCE TTPEiDucleic acid 
STRANDEDNESS:double 
topology: linear 

MOLECULE TYPE'.cDNA to geooiaic KNA 

CG CAG TCA TTC CAA GTG GCC CAT CTA CAC GCT CCC ACT CGC AGC GGC 47 
GId Ser Phe Gin Val Ala His Leu His Ala Pro Thr Giy Ser Gly 
15 10 IS 

AAG AGT ACT AAA GTG CCG GCT OCA TAT GCC AGC CAA GGG TAC AAG GTG 95 
lys Ser Tbr Ljs Val Pro All All Tjr Ala Ser Gla Gly Tyr L;s Val 

20 25 30 

CTC GTC CTC AAC CCG TCC GTT GCC GCC ACC TTA GGT TTT GCA GCG TAT 143 
lea Val Leu Asd Pro Ser Val Ala Ala Tkr Lea GIr Phe Gly Ala Tyr 

35 40 45 

ATG TCT AAG GCA CAT GGC ACC GAC CCC AAC ATC AGA ACT GGG GTA AGG 191 
Met Ser Lys Ala His Gly Thr Asp Pro Asd lie Are Thr Gly Val Arg 
50 55 50 

ACT ATC ACC ACA GGC GCC CCC ATC ACG TAC. TCC ACC TAC GGC AAC TTC 239 
Thr He Thr Tkr Gly Ala Pro He Thr Tyr Ser Tbr Tyr Gly Lys Pke 
(5 70 75 

CTT CCC GAC GCT GGT TGT TCT GGG CGC GCT TAT GAC ATC ATA ATG TGT 287 
Leo Ala Asp Giy Gly Cys Ser Gly Gly Ala Tyr Asp He He Met Cys 
80 85 90 95 

GAT GAG TGC CAC TCA ACT GAC GCG ACT TCC ATC TTC GGC ATC GGC ACG 335 
Asp GId Cys Bis Ser Tkr Asp Ala Tkr Ser He Leo Gly He Gly Tkr 

iTD 105 110 

GTC CTG GAC CAA GCG GAG ACG GCT GGA GCA CGG CTC GTC GTG CTC GCC 383 
Val Leo Asp Gli Ala Gla Tbr Ala Gly Ala Art Lea Val Val Leu Ala 

115 120 125 



55 
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ACC GCT ACG CCT CCC GGA TCG GTC ACC GTG CCA CAC CCG AAT ATT GAG 
Thr Ala TIr Pro Pro Gl; Ser Val Thr Val Pro His Pro Asd lie Glu 
130 135 140 



431 



10 



IS 



20 



2S 



30 



35 



40 



GAG GTG GCC CTG TOT AAC ACT GGA GAG ATC CCC TTC TAT GGC AAA GGC 

Glo Val Ala Leo Ser Asd Tbr GIj GId lie Pro Phe Tjr Gly Ljs Glj 
145 150 155 

ATC CCC ATT GAA GTC ATC AAG GGG GGA AGG CAT CTC ATT TTC TGC CAT 

He Pro He Glo Yal He Lys Gly Gly Ar^ Bis Leo He Phe Cys His 
160 165 170 175 

TCC AAG AAG AAG TGC GAC GAG CTC GCC GCG AAG TTG TCA GGC CTC GGG 

Ser Lys Lys Lys Cys Asp Glo Leo Ala Ala Lys Leo Ser Gly Leo Gly 

180 185 190 

ATT AAT GCT GTG GCA TAC TAC CGG GGT CTT GAT CTG TCC GTC ATA CCG 

He Aso Ala Yal Ala Tyr Tyr Arg Gly Leo Asp Val Ser Yal He Pro 

195 200 205 

ACC AGC GGA GAC GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC 

Thr Ser Gly Asp Val Yal Val Val Ala Thr Asp Ala Leu Met Thr Gly 
210 215 220 

TAT ACC GGC GAT TTT GAC TCA GTG ATC CAC TCT AAC ACA TCC GTC ACC 

Tyr Thr Gly Asp Pke Asp Ser Val He Asp Cys Asn TIr Cys Yal Thr 
225 230 235 

CAG ACA GTC GAC TTC AGC TTG GAC CCC ACC TTC ACC ATT GAG AC 

Gin Tbr Val Asp Phe Ser Leo Asp Pro Thr Phe Thr He Glo 
240 245 250 



479 



527 



575 



623 



671 



719 



763 



SEQ ID NO: 2 
<5 SEQUENCE LENGTH:615 base pairs 

SEQUENCE TYPE:oacleic acid 

STRANDEDNESSldoobKe- 

TOPOLOGY: linear 
so MOLECULE TYPE:cDNA to geoonic RNA 

C ACG CCC GGT TTG CCC GTG TGT CAA GAC CAC CTG GAC TTC TGC GAA GCG 49 
Thr Pro Gly Leo Pro Val Cys Glo Asp His Leo Glo Phe Trp Glo Ala 
» 1 5 10 15 
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10 



15 



30 



35 



40 



45 



CTC TTC ACA CGT CTC ACG CAC ATT GAT GCC CAC TTC CTC TCC CAG ACA 97 

Vai Phe Thr Gly Leu Tbr His ile Asp Al3 His Phe Leo Sei GId Tlr 

20 25 30 

AAG CAA GGA GGA GAC AAC TTC GCG TAT CTA ACG GCC TAC CAG GCC ACA H5 

Ljs GId Cly Glj Asp Asn Pke Ala Tyr Lea Tkr Ala Tjr GId Ala Tlr 

ii 40 45 

GTG TGC GCT AGG CCA AAC GCC CCT CCT CCC TCG TCG GAT GTC ATG TGG 193 

Vai Cys Ala Ar| Ala Ljs Ala Pro Pro Pro Ser Trp Asp Val Uet Trp 

50 55 60 



AAA TGT CTA GCT AGG CTG AAC CCT ACA CTA ATT GGT CCT ACC CCC CTC 241 
20 Ljs Cfs Leo Ala Arg Leo Ljs Pro Thi Lea lie Glj Pro Thr Pro Les 

65 70 75 



CTC TAC CCC TTG GGT GCC GTC ACC AAC CAG GTT ACC CTC ACG CAC CCC 289 
25 Leu Tyr An Leo CI; Ala Vai Tbr Asn Gin Val Tbr Leo Tkr Bis Pro 

g5 . 90 95 



GTG ACG AAA TAC ATC GCC ACG TGC ATG CAA CCT GAC CTC GAG ATC ATG 3J7 

Val Tkr L;s Tyr lie Ala Tkr C;s Mel Gin Ala Asp Leo Gin Ile Met 

100 105 110 

ACG AGC ACA TGG CTC CTA GCA CGG GGG GTC CTA GCC GCC GTG GCA GCT 385 

Tkr Ser Tkr Trp Val Leo Ala Gly GIr Vil Leo Ala Ala Val Ala Ala 

115 120- 125 

TAC TGC CTG GCA ACC GCC TGT GTT TCC ATC ATC 6GC C6C CTA CAC CTG 4J3 

Tyr Cys Lei Ala Tkr Gly Cys Yal Ser Ile Ile Gly An Let Bis Leo 
130 135 140 

AAT GAT CAA GTC GTT GTG ACT CCT CAC AAA CAA ATC TTA TAT GAG GCC 481 

Ato Asp Gil Val Val Val Tkr Pro Asp Lys Glo Ile Leo Tyr Gli Ala 
145 150 155 160 



TTT GAT GAG AT(r€AA CAA TGC GCC TCC AAA GCC CCC CTC ATT GAG CAA 529 
Pke Asp Glo Met Glo Glo Cys Ala Ser Lys Ala Ala Leo Ile Glo Glu ^ 
" 165 170 175 

GGG CAG CGG ATG GCG GAC ATG CTC AAG TCT AAC ATA CAA GCC CTC CTA 577 
Gly Gla Art Met Ala Glo Uet Leu Lys Ser Lys lie Glo Gly Leo Leu 
w 180 185 190 
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CAA CAG GCC ACA AGA CAG GCC CAA GAC ATA CAG CCA GC 
GId Gin All Thr Ar^ Gin Ala Glo Asp He Gla Pro 
I9S 200 



SEQ ID NO: 3 

SEQUENCE LENGTB:771 base pairs 
SEQUENCE TYPEloacleic acid 
STRANDEDNESS-.dooble 
topology: linear 

UOLECOLE TYPE:cDNA to genonic RNA 

GT GAG CGA GCC TCA GGA ATG TTT GAC ACT GTA GTC CTC TGT GAG TGC 
Glo ArK Ala Ser GIt Met Pke Asp Ser Val Val Leo Cys Glo Cjs 
IS 10 15 

TAT GAC GCA GGG GCT GCA TGG TAC GAG CTT ACA CCA GCG GAG ACC ACC 
Tyr Asp Ala Gly Ala Ala Trp Tyr Glo Leo Tkr Pro Ala Glo Tkr Thr 

20 25 30 

GTC AGG CTC AGA GCG TAT TTC AAC ACA CCT GGC TTG CCT CTG TGT CAA 
Val Are Leo An Ala Tyr Phe Asn Thr Pro Gly Leo Pro Val Cys Glo 

35 40 45 

GAC CAT CTT GAG TTC TGG GAC GCA GTT TTC ACC GGC CTC ACA CAC ATA 
Asp His Leo Glo Phe Trp Glo Ala Val Phe. Thr Gly Leu Thr Bis lie 
50 55 . 60 

GAT GCC CAC TTC CTT TCC CAG ACA AAG CAA GCA GGG GAC AAT TTC GCA 
Asp Ala Bis Phe Leo Str Gh Thr Lys Glo Ala Gly Asp Asa Phe Ala 
65 70 75 

TAC TTG ACA GCC TAC CAG GCT ACA GTC TGC GCC AGA GCC AAA GCC CCT 
Tyr Leo Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro 

85 90 95 



CCC CCG TCC TGG GAC GTC ATG TGG AAG TGC CTG ACT CGG CTC AAG CCC 

Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leo Thr Arg Leo Lys Pro 

100 105 110 

ACC CTT GTG GCC CCT ACA CCC CTT CTG TAC CGT TTA GGC TCT GTT ACT 

Thr Leo Val Ala Pro Thr Pro Leo Leo Tyr Arg Leu Gly Ser Val Thr 

lis 120 125 
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10 



15 



20 



25 



30 



35 



40 



45 



AAC GAG GTC ACC CTC ACA CAT CCT GTG ACG AAA TAG ATC GCC ACT TGC 431 

Asn Glu Yal Tfcr Leu Tbr His Pro Val Tlir Lfs Tjr lie Ala Tbr Cys 

130 1 35 NO 

ATG CAA GCT GAG CTT GAG GTC ATG ACC AGC ACG TGG GTC CTA GCT GGG 479 

yet GId Ala Asp Leu Glo Val Met Thr Ser Tbr Trp Val Lea Ala Gir 
145 ISO 155 

GGG GTC TTC GCA GCC GTC GCC GCG TAT TGC CTG GCG ACT GGG TGT GTC 527 

Glj Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Giy Cys Val 
160 165 170 175 

TCC ATC ATC GGC CGC TTG CAC ATC AAT CAC CGA GCC GTC GTT GCA CCA 575 

Ser lie He Gly Art Leo His He Asa Glo Arg Ala Val Val Ala Pro 

180 185 190 

GAG AAG GAG GTC CTT TAT GAG GCT TTT GAT GAG ATG GAG GAG TGT GCC 623 

Asp lys Glu Val Lea Tyr Glu Ala Phe Asp Gh Uet Glu Gin Cys Ala 

195 200 205 

TCT AAA GCG GCT CTC ATT GAA GAG GGG CAG CGG ATA GCC GAG ATG CTG 671 

Ser Lys Ala Ala Leu He Glu Glu Gly Glu Arg He Ala Gh Met Leo 

210 215 220 

AAG TCC AAG ATC CAA GGC TTA TTG CAG CAA GCC TCT AAA CAG GCC CAG 719 

Lys Ser Lys He Gin Gly Leo Len Glo Glo. Ala Ser Lys Glo Ala Gin 
225 230 235 

CAC ATA CAA CCC GCT GTG CAG CCT CAT GGC CCA AGG TGG AGC AAT TCT 767 

Asp He Glo Pro Ala Val Glo Pro His Gly Pro Arg Trp Ser Aso Ser 
240 245 250 255 

GGG C 771 
Gly 



SEQ ID NO: 4 
so SEQUENCE LENGTH: 630 base pairs 

SEQUENCE TYPE:oocleic acid 

STRAND£DNESS:dooble 

TOPOLOGY: 11 near 
55 MOLECULE TYPE:(DNA to genonic RNA 
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C TGG TAT GAA CTT ACG CCT GCT GAG ACT ACC GTG AGA CTC COG GCC TAT 49 
Trp Tjr Glu Leu Ttr Pro Ala Gh Thr Thr Val Arg Leu Arg Ala Tyr 
15 ID 15 



10 



TTC AAC ACG CCC GGC CTG CCT GTG TGT CAA GAC CAC CTG GAA TTC TGG 
Phe AsD Thr Pro Gly Leo Pro Vat Cys Gin Asp Bis Lea Glu Phe Trp 

20 25 30 



97 



15 



GAG GCG GTC TTC ACA GGT CTC ACA CAC ATC GAT GCC CAC TTC CTC TCC 
Gin Ala Val Pbe Thr Gly Leu Thr His He Asp Ala His Phe Leu Ser 
35 40 45 



145 



20 



CAG ACG AAG CAA GGA GGA GAT AAC TTT GCA TAT TTA ACA GCC TAG CAG 
Gin Thr Lys Gin Gly Gly Asp Aso Phe Ala Tyr Leu Thr Ala Tyr Glo 
50 55 60 



193 



25 



30 



GCC ACA GTC TGC GCT AGG GCA AAG GCT CCC CCT CCT TCG TGG GAC GTG 
Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Val 
65 70 75 



ATG TGG AAG TGT TTG ATT AGG CTC AAA CCT ACA CTG ACT GGT CCT ACC 
Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr Leu Thr Gly Fro Thr 

85 90 95 



241 



289 



35 



CCC CTC CTG TAG CGC TTG GGT GCC GTC ACC AAC GAC GTT ACC CTG ACT 
Pro Lea Leu Tyr Arg Lea Gly Ala Val Thr Asa Glo Val Thr Leo Thr 

100 105 110 



337 



40 



45 



BO 



55 



CAC CCC ATG ACG AAA TAT ATC GCC ACT TGT ATG CAA GCT GAT CTT GAG 

Bis Fro Met Thr Lys Tyr He Ala Thr Cys Uet Glo Ala Asp Leu Gla 

115 120 125 

ATC ATG ACA AGO ACA TGG GTC TTG GCG GGG GGG GTC CTA GCC GCT GTG 

He Met Thr Ser Thr Trp Val Leu Ala Gly Gly Val Lea Ala Ala Val 

130 135 140 

GCA GCT TAC TGC CTA GCG ACC GGC TGC ATT TCC ATC ATT GGC CGC CTT 

Ala Ala Tyr Cys Leu Ala Thr Gly Cys He Ser He He Gly Arg Lea 

145 ISO 155 160 

CAC CTG AAT GAT CCG GTC GTC GTG ACC CCT GAT AAG GAA ATT TTA TAT 

His Leu Aso Asp Arg Val Val Val Thr Pro Asp Lys Glu He Leu Tyr 

165 170 175 



3S5 



433 



4S1 



529 
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GAG GCC TTT GAT GAG ATG GAA GAG TGC GCC TOO AAA GCC GCC CTC ATT 
Glo Ala Pie Asp Glo Met Glu GId Cys Ah Ser Lys Ala Ala leu (le 

180 US 190 

GAG GAA GGG CAG CCG ATG GCC GAG ATG CTG AAG TOT AAA ATA CAA GGC 
Gla Glo Gl; GIb Arg Uet Ala Gin Met leu ly$ Sei Ljs lie Glo Gly 
195 200 205 

CTC TT 
leo 



S£Q ID no: 5 

SEQUENCE 1£NGTH:H26 base pairs 
SEQUENCE TYPEiiucleic acid 
STRANDEDNESSldoDble 
TOPOLOGY: linear 

UOLECUIE TYPE:cDNA to (enooic RNA 

GGG ATC AAC CCT AAC ATC AGG ACC GGA GTA CGG ACC GTG ACC ACC GGG 
Gly lie Aso Pro Aso He Arg Thr Gly Val Art Tkr Val Tbr Tbr Gly 
15 10 15 

GAC TCC ATC ACC TAC TCC ACT TAT GGC AAG TTT ATC GCA GAT GGA GGT 
Asp Ser He Tbr Tyr Ser Tbr Tyr Gly Lys Pbe He Ala Asp Gly Gly 

20 25 30 

TGC CCA CAT GGT GCC TAT GAC GTC ATC ATA TCC GAC GAA TGC CAT TCA 
Cys Ala Bis Gly Ala Tyr Asp Val He He Cys Asp Glo Cys Bis Ser 
35 40 45 

GTG GAC GCT ACT ACC ATC CTT GGC ATT GGA ACA GTC CTT GAC CAG GCT 
Val Asp Ala Tbr Tbr He Leo Gly He Gly Tbr Val Leo Asp Glo Ala 
SO 55 (0 

GAG ACC GCA GGT GCC AGG CTA GTG CTT TTA GCC ACA GCC ACG CCA CCC 
Glo Tbr Ala Gly iTTa Arg. Leo Val Val leo Ala Tbr Ala Tbr Pro Pro 
65 70 75 80 

GGT ACG GTA ACA ACT CCC CAC GCT AAC ATA GAG GAG GTG GCC CTT GGT 
Gly Tbr Val Tbr Tbr Pro His Ala Asn He Gin Glo Val Ala Leu Gly 

85 90 95 
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10 



15 



CAC GAA CGC GAG ATT CCT TTT TAT GGC AAG GCT ATT CCC CTA GCT TTC 

His Glu GIr GId He Pro Phe Ijt Gly lys Ala He Pro Leu Ala Phe 

100 lOS HO 

ATC AAG CGG CGC AGA CAC CTA ATT TTT TGC CAT TCA AAG AAG AAG TGC 

He Ljs Gly Gly Ar; Hit Leu He Phe Cjs Bis Str Ljs Ljs Ljs Cys 
IIS 120 I2S 

GAC GAG CTC GCA GCA GCC CTT CGG GGC ATG CGT ATC AAT GCC GTT GCC 

Asp Glu Leo Ala Ala Ala Leo Arg Gly Met Gly He Asn Ala Val Ala 

130 135 140 



336 



384 



432 



20 



TAC TAC AGG OCT CTC GAC GTC TCC GTT ATA CCA ACT CAA GGA CAC CTC 
Tyr Tyr Arg Gly Lea Asp Val Ser Val lie fro Tkr Gin Gly Asp Val 
145 150 155 160 



480 



25 



GTG GTT GTC GCC ACC GAT GCC CTA ATG ACT GGA TAC ACC CGT GAC TTT 
Val Val Val Ala Tkr Asp Ala Leu Met Tkr Gly Tyr Thr Gly Asp Pke 

165 170 175 



528 



30 



GAC TCT GTC ATC GAC TGC AAC GTT GCA GTC ACT CAC ATT GTT GAC TTT 
Asp Ser Val He Asp Cys Asa Val Ala Val Tkr Gin He Val Asp Pke 

180 185 190 



576 



35 



ACC CTA GAC CCA ACT TTT ACC ATC ACC ACT CAA ACC GTC CCT CAC GAG 
Ser Leo Asp Pro Tkr Pke Tkr He Tkr Tkr do Tkr Val Fro GIb GId 
195 200 205 



624 



40 



45 



GCT GTC TCC CGT ACT CAA CGT AGA GCC AGA ACT CGG ACC GGC CCA CTC 

Ala Val Ser Art Ser GId Ar( Arg Gly Arg Tkr Gly Arg Gly Arg Lea 

210 215 220 

GGC ACT TAC ACC TAT CTC TCC TCA GGC GAG AGG CCG TCT GCC ATG TTC 

Gly Thr Tyr Arg Tyr Val Ser Ser Gly Glo Arg Pro Ser Gly Met Phe 

225 230 23S 240 



672 



720 



60 



59 



GAC ACC CTA CTA CTC TGC GAG TCC TAT GAT GCC CGG CCA CCC TGC TAC 

Asp Ser Val Val Leo Cys Gli Cys Tyr Asp Ala Gly Ala Ala Trp Tyr 

245 250 255 

GAG CTT ACA CCT GCT GAG ACC ACA GTG AGA CTC CGG GCT TAT TTC AAC 

Glu Leo Thr Pro Ala Glo Thr Thr Val Arg Leo Arg Ala Tyr Phe Asa 

260 265 270 



768 



816 



24 
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ACG CCC CGT TTG CCC GTG TGT CAA CAC CAC CTG GAG TTC TGC GAA GCG 
Tlir Pro Gl; Leu Pro Vsl Cn Glo Asp His Lto Glo Pke Trp Glo Ala 
2?S 280 2t5 



10 



CTC TTC ACA GCT CTC ACG CAC ATT GAT CCC CAC TTC CTC TCC CAG ACA 
Val Phe Ttr Gly Leu Thr Bis lie Asp Ala His Pke Leo Ser GIs Tkr 
290 295 300 



912 



15 



20 



AAG CAA CCA GGA CAC AAC TTC GCG TAT CTA ACG CCC TAC CAC GCC ACA 

Its Glo Gh GIt Asp Aso Pbe Ala Tjr Leo Tkr Ala Trr Glo Ala Tbr 

305 310 315 320 

CTG TGC GCT ACG CCA AAG GCC CCT CCT CCC TCG TGC GAT GTG ATG TGG 

Val Cjs Ala Arg Ala Lji Ala Pro Pro Pro Ser Trp Asp Val Uet Trp 

325 330 335 



960 



1008 



25 



AAA TGT CTA GCT ACG CTC AAG CCT ACA CTA ATT GCT CCT ACC CCC CTC 
Lys C;s Itu Ala Arg Leo Lys Pro Thr Leo lie Cly Pro Tkr Pro Leo 

340 345 350 



1056 



30 



35 



40 



CTC TAC CGC TTG GCT GCC CTG ACC AAC GAG CTT ACC CTG ACG CAC CCC 

leu Tyr Arg Leo Gly Ala Val Thr Aso Glo Tal Thr Leo Thr His Pro 
355 360 365 

GTG ACG AAA TAC ATC GCC ACG TGC ATG CAA GTG AAC CTC GAG ATC ATG 

Val Thr Lys Tyr lie Ala Thr Cys Met Glo Val Asi Leo Glo He Uet 

370 375 380 

ACG AGO ACA TGG 6TC CTA CCA G6G GGG GTG CTA GCC GCC GTG GCA GCT 

Thr Ser Thr Trp Val Leo Ala Gly Gly Val Leo Ala Ala Vi! Ala Ala 
385 390 395 400 



1104 



1152 



1200 



45 



TAC TGC CTC GCA ACC GCC TGT CTT TCC ATC ATC CGC CCC CTA CAC CTG 
Tyr Cyt Leo Ala Thr Gly Cys Val Ser He He Cly Arg Leo His Leo 

405 410 4IS 



1248 



50 



AAT GAT CAA CTG GtT GTG ACT CCT CAC AAA CAA ATC TTA TAT CAC CCC 
Asi Asp Glo Val Val Val Thr Pro Asp lys Glo He Leo Tyr Glo Ala 

420 425 430 



1296 



55 



TTT GAT GAG ATG GAA GAA TGC GCC TCC AAA GCC GCC CTC ATT CAG CAA 
Phe Asp do Met Glo Glo Cys Ala Ser Lys Ala Ala Leo He Glo Glo 
435 440 445 



1344 



25 



EP0468 657 A2 



GGG CAG CGG ATG CCG GAG ATG CTC AAG TCT AAG ATA CAA CGC CTC CTA 
GI7 GId ktg Met Ala Glo Met Leo Lys Ser Ljs Me GId GIj Leo Leo 
450 455 460 



1392 



10 



CAA CAG GCC ACA AGA CAG GCC CAA GAC ATA CAG C 
Glo Glo Ala Tbr Arg Glo Ala Glo Asp Tie Glo 
465 470 475 



1426 



„ SEQ ID no: 6 

SEQUENCE LENGTH: 855 base pairs 

SEQUENCE TYPE:oucleic acid 

STRANDEDNESSldouble 
20 TOPOLOGY: 11 Dear 

MOLECULE TYPE:cDKA to (eooaic RNA 

CG CAG ACA TTC CAA GTG GCC CAT CTC CAC CCT CCC ACT GCT AGC GGC 47 
25 Glo Tbr Pkc Glo Val Ala His Leo His Ala Pro Thr Gh Ser GI7 
15 10 15 

AAG AGC ACT AAG GTG CCG GCT GCA TAT GCG GCC CAA GGG TAC AAG GTA 95 
30 Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Glo Gly Tyr Lys Val 

20 25 30 

CTC GTC CTG AAC CCC TCC GTT GCC GCC ACT TTA GCC TTT GGG CCC TAC 143 
^ Leu Val Leo Aso Pro Ser Val Ala Ala Thr Leo Ala Phe Gly Ala Tyr 

35 40 45 

ATG TCT AAG GCA CAT GOT GTC GAC CCT AAC ATC AGA ACT GGG GTG AGG 191 
*> Uet Ser Lys Ala His Gly Val Asp Pro Aso lie Arg Tkr Gly Val Arg 

50 55 60 



45 



SO 



55 



ACC ATC ACC ACG GGC CCT CCC ATC ACC TAC TCC ACC TAT CGT AAG TTC 239 

Tkr lie Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe 

65 70 7S 

CTT GCC GAC GGT GGT TGC TCT GGG GCC GCC TAT GAC ATC ATA ATA TGT 287 

Leo Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp lie He He Cys 

80 85 90 95 

GAT GAC TGC CAC TCA ACT GAC TCG ACA TCC ATC TTC GGC ATC GGC ACA 335 

Asp Glo Cys His Ser Tbr Asp Ser Tbr Ser He Leu Gly He Gly Tbr 

100 105 lit 



26 
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GTC CTG GAC CAA CCG GAG ACG GCT GCA GCG CCG CTC GTC GTG CTC GCT 
Val Leo Asp Glo All GId Tbr Ala Gl; Ala Arg Lea Val Val Lea Ala 
s 115 120 12S 



383 



10 



15 



ACC GCT ACG CCT CCG GCA TCG GTC ACC GTG CCA CAT CCC AAT ATC GAG 
Thr Ala Tkr Pro Pro Glj Ser Tai Thr Val Pro His Pro Aso He Glu 
130 135 HO 

GAG GTG GCC CTG TCC ACC ACT GCA GAG ATT CCC TTC TAC GGC AAA CCT 
GIq Val Ala Lea Ser Thr Thr Gl; Glo lie Pro Phe Tjr GI7 Lji Ala 
H5 150 155 



431 



479 



20 



ATC CCC ATC GAG ACA ATC AAG GGG GGC AGG CAT CTC ATC TTC TGC CGT 
lie Pro He GId Thr He Lys Q\j GIt Arg His Leu He Phe Cys Arg 
160 165 170 175 



527 



25 



TCC AAG AAG AAG TGT GAC GAG CTC GCT GGA AAG CTG TCA GCC CTC GGA 
Ser Lys Ly$ Lys Cys Asp Glo Lea Ala Gly Lys Leu Ser Ala Leo Gly 

180 185 190 



575 



30 



ATC AAC GCT GTA GCG TAC TAC CGG GGT CTT GAT GTA TCC GTC ATA CCG 
He Asa Ala Yal Ala Tyr Tyr Arg Gly Leo Asp Val Ser Val He Pro 

195 200 205 



623 



35 



ACC AGC GGA GAC GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC 
Thr Ser Gly Asp Val Val Val Yal Ala Thr. Asp Ala Leo Met Thr Gly 
210 215 220 



671 



40 



45 



TAC ACC GGT GAC TTT GAT TCA GTC ATC GAC TGC AAT ACA TGT GTC ACC 

Tyr Tkr Gly Asp Phe Asp Ser Val He Asp Cys Aso Thr Cys Val Thr 

225 230 235 

CAG ACA GTC GAC TTC AGC TTG GAC CCT ACC TTC ACC ATT GAG ACG ACG 

Glo Thr Val Asp Phe Ser Leo Asp Pro Tbr Phe Tbr He Gla Tkr Thr 

240 245 250 255 



719 



767 



so 



ACC GTG CCT CAA GAC GCG GTG TCA CGC TCG CAG CGG CGA GGC AGA ACT 
Thr Val Pro Glo Asp Ala Val Ser Arg Ser Gin Arg Arg Gly Arg Tkr 

260 265 270 



815 



GGT AGG GGT AGA GGG GGC ATA TAC AGC TTT GTG ACT CCA G 
55 Gly Arg Gly Arg Gly Gly He Tyr Arg Phe Val Tbr Pro 

275 280 



855 
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SEQ ID NO: 7 

SEQUENCE LENGTH:31S base pairs 
SEQUENCE TYPEioucieic acid 
STRANDEDNESSidouble 
TOPOLOGT: linear 

MOLECOLE TYPErcDNA to genonic RNA 

GAG GAG CTC GCC GCA AAG CTG TCA GGC CTC GCA GTC AAT OCT GTG GCA 

Asp GId Lea Ala Ala Ljs Leu Ser Qlj Leo Glf Val Aso Ala Val Ala 

15 10 IS 

TAC TAC CGG GGT CTC GAT GTG TCT GTC ATA CCC ACC AGC GGG GAG GTC 

Trr Tjr Ar{ Gl; Leu Asp Val Ser Val lie Pro Thr Ser Glj Asp Val 

20 25 30 

GTT GTT GTG GCA ACA GAC GCT CTA ATG ACG GGC TAT ACC GGC GAC TTT 

Val Val Val Ala Tlr Asp Ala Leo Met Thr Glj Tjr Tlir Glj Asp Phe 

35 40 45 

GAC TCG GTG ATC GAC TGC AAT ACA TGT GTC ACC CAA ACA GTC GAT TTC 

Asp Ser Val He Asp Cji Aso Thr C;s Va! Thr Glo Thr Val Asp Phe 

50 55 60 

AGC TTG CAC CCT ACT TTC ACC ATT GAG ACG ACG ACC GTG CCC CAA GAC 

Ser Leu Asp Pro Thr Phe Thr He Glu Thr. Thr Thr Val Pro Glo Asp 

65 70 75 SO 

GCG GTG TCG GGC TCG CAG CGG CGA GGC AGG ACT GGT AGG GGC AGG GTG 

Ala Val Ser Arc Ser Glo Arg Art Gl; An Thr GIr Arg GIr Arg Val 

S5 90 95 

GGC ATA TAC AGG TTT GTC ACT CCC GAG 

Gil lie lit Arg Phe Val Thr Pro Glo 

100 105 
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SEQ ID NO: 8 

SEQUENCE LENGTH:91I base pairs 
SEQUENCE TYPElDucleic acid 
STRANDEDNESSldouble 
TOPOLOGY: linear 

MOLECULE TYPE:cDNA to geoomic RNA 

GT GAT GAG CTC GCC GCA AAG CTC TCA ACC CTC GGA CTC AAC GCT GTA 
Asp Clu Leu Ala Ala Ljs Leo Ser Ser Leu Giy Leu Asn Ala Val 
IS 10 IS 

GCA TAT TAC CGG GCT CTT GAT CTG TCC GTC ATA CCG ACT ACT GGA GAC 
Ala Tyr Tyr Arg Glf Leu Asp Val Ser Val He Pro Thr Ser Gly Asp 

20 25 30 

GTC GTT GTC GTG GCA ACA GAC GCT CTA ATG ACG GGC TAT ACC GGC GAC 
Val Val Val Val Ala Tkr Asp Ala len Met Thr Gly Tyr Thr Gly Asp 

35 40 45 

TTT GAC TCA GTG ATC GAC TGT AAC ACA TCT GTC ACC CAG ACA GTT GAT 
Phe Asp Ser Val lie Asp Cys Asd Thr Cys Val Thr Glo Thr Val Asp 

50 55 60 

t 

TTC AGC TTG GAT CCA ACC TTC ACC ATT GAG ACG ACG ACC GTG CCT CAA 

Phe Ser Leu Asp Fro Thr Phe Thr He Gin Thr Thr Thr Val Pro Glo 
65 70 75 

GAC GCG GTG TCG CGC TCG CAG CGG CGA GOT AGG ACT GGC AGG GGC AGG 
Asp Ala Val Ser Arg Ser Glo Arg Arg Gly Arg Thr Gly Arg Gly Arg 
80 85 90 95 

GGC GGC ATC TAT AGG TTT GTG ACT CCA GGA GAA CGG CCC TCG GGC ATG 
Gly Gly He Tyr Arg Phe Val Thr Pro Gly Glo Arg Pro Ser Gly Met 

100 105 HO 

TTC GAT TCC TCG GTC CTG TGT GAG TGT TAT GAC GCG GGC TGT CCT TGG 
Phe Asp Ser Ser Vrl-Leu Cys Glo Cys Tyr Asp Ala Gly Cys Ala Trp 

115 120 125 

TAT GAC CTC ACG CCC GCC GAC ACC ACG GTT AGG TTG CGG GCT TAC CTA 
Tyr Glo Leu Thr Pro Ala Glo Thr Thr Val Arg Len Arg Ala Tyr Leu 
130 135 140 
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10 



IS 



20 



25 



30 



35 



40 



45 



50 



AAT ACA CCA GGG TTG CCC GTC TGC CAG GAC CAT CTG GAG TTC TGG GAG 
Ajd Thr Pro Gly Leu Pro Val Cjs Gin Asp Bis Leu Gin Fie Trp Glu 
H5 150 155 

GGC GTC TTC ACA GGC CTC ACC CAC ATA GAT GCC CAT TTC TTC TCT CAG 
Gly Val Phe Tkr Cly Leu Tkr Bis lie Asp Ala Bis Phe Leu Ser GId 

1(0 165 no 

ACT AAG CAG GCA GGA CAC AAC TTT CCC TAC CTG GTC GCA TAC CAA GCT 
Thr Lys Gin Ala Gly His Asn Phe Pro Tyr Lea Val Ala Tyr Ch Ala 

180 185 190 

ACA GTC TGC GCC AGG GCT CAG GCT CCA CCT CCA TCC TGG GAC CAA ATG 
Tlr Val Cys Ala Arg Ala GId Ala Pro Pro Pro Ser Trp Asp Gli Met 

19S 200 205 

TGG AAG TGT CTC ATA CGG CTG AAA CCT ACG CTG CAC GGG CCA ACA CCC 
Trp Lys Cys Len He kit Leu Lys Pro Tkr Leu Bis Gly Pro Tlr Pro 
210 215 220 

CTG CTC TAT ACG CTA CCA GCC GTC CAA AAT CAG CTC ACC CTC ACA CAC 
Len Leu Tyr Arg Leu Cly Ala Val Glu Asn Glu Val Tlr Leu Thr His 
225 230 235 

CCC ATA ACC AAA TTC ATC-ATG GCA TGC ATG. TCC GCT GAT CTG CAC GTC 
Pro He Thr lys Pie He Mel Ala Cys Met. Ser Ala Asp Len Glu Val 
240 245 250 255 

GTC ACC AGC ACC TGG CTG CTG GTG GGC GGA GTC CTT GCA GCT CTC GCC 
Val Thr Ser Thr Trp Val Len Val Gly Gly Val Len Ala Ala Leu Ala 

260 265 270 

GCA TAT CGC CTG ACA ACA GGC AGC CTG GTC ATC GTG GCT ACG ATC ATC 
Ala Tyr Art Lea Thr Thr Cly Ser Val Val Me Val Cly Arg He He 

275 280 285 

TTG TCT GGG AGG CCC CCT CTC ATT CCC GAC AGG CAA GTC CTT TAC CGG 
Leo Ser Gly Arg Pre Ala Val He Pro Asp Arg Gin Val Len Tyr Arg 
290 295 300 



479 



527 



575 



623 



671 



719 



767 



815 



863 



911 



55 



30 
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SEQ ID N0:9 

SEQUENCE LENGTH:4g9 base pairs 
SEQUENCE TYPEiDDcleic acid 
STRANDEDNESSldooble 
TOPOLOGY: I i Dear 

MOLECULE TYPElcDNA to gCDOisic RNA 



CG ACA ACC GTG CCC CAA GAC GCG GTG TCC CGC TCA CAA CGG CGG GGT 
Tbr Tbr Val Pro Glo Asp Ala Vai Ser Arj Ser Gin Ar{ Arg Gij 
15 10 15 

AGG ACA GGT ACG GGC AGG AGA GGC ATC TAC AGA TTT GTG ACT CCG GGA 
Arg Tbr Glj Arg Cly Arg Arg Glj lie Tjr Arg Pke Val Tbr Pro Glj 

20 25 30 

GAA CGG CCC TCG GGC ATG TTC GAT TCT TOG GTC CTG TGT GAG TGC TAT 
GIq Arg Pro Ser Glj Met Pbe Asp Ser Ser Val Leu Cjs Glo C;s Tjt 

35 40 45 

GAC CCG GGC TGC GCT TGG ATC GAG CTC ACG CCC GCC GAG ACC TCA GTT 
Asp Ala Gly Crs Ala Trp He Gli Leo Tbr Pro Ala Glo Tbr Ser Val 
50 55 60 

AGG TTG CGC GCT TAC CTA AAT ACA CCA GGG TTG CCC GTC TGC CAG GAC 
Arg Leo Arg Ala Tjr Leo Asd Tbr Pro Glj Leu Pro Val Cjs Gin Asp 
65 70 75 

CAC CTG GAA TTC TGG GAG AGC GTC TTC ACA GGC CTC ACC CAT ATA GAT 
His Leu Glo Pke Trp Gli Ser Val Pbe Tbr Glj Leu Tbr His lie Asp 

85 90 95 



GCC CAC TTC TTG TCC CAG ACC AAG CAG GCA GGA GAC AAC TTC CCC TAC 

Ala His Pbe Leo Ser Gin Tbr Ljs Gin Ala Glj Asp Aso Pbe Pro Tjr 

100 105 110 

CTG CTA GCA TAC CAA GCT ACA GTG TGC GCC AGG GCC CAG GCT CCA CCA 

Lea Val Ala Tjr Gtn Ala Tbr Val Cjs Ala Arg Ala Glo Ala Pro Pro 

lis 120 125 



CCA TCG TCC CAT CAA ATC TGG AAG TGT CTC ATA CCG CTG AAA CCT ACC 
Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie ArgLeu Lys Pro Thr 
130 135 140 
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CTA CAC GGC CCA ACA CCC CTG TTG TAT AGG CTG GGA GCC GTC CAA AAT 

Lea His Gly Pro Tlr Pro Lea Leu Tyr Arg Leu Glj k\i Vjl Gin Asa 
HS ISO 155 

GAG GTC ACC C 

Gla Val Thr 
160 



SEQ ID NO:iO 

SEQDEHCE LENGTfl:l076 base pairs 
SEQOEHCE TYPElDocleic acid 
STRANDEDNESSidoulle 
TOFOLOGY:lioear 

MOLECULE TYPElcDNA to {eoonlc ENA 

GT GGT GTC CTG GGT GCC ATC GIG GTC AGC CTA ACG GGC CGC GAC AAG 
Gir Leu Leo Gly Aia lie Vai Val Ser Leo Tbr Glj kit Asp Ljs 
15 10 15 

AAC CAG GTC GAC GGG GAG GTT CAG GTG GTC TCC ACC GCA ACC CAA TCT 
AsD GId Val Glu Gl; Gh Val Gin Val Val Ser Tkr Ala Thr GIb Ser 

20 25 30 

TTC CTG GCG ACC TGC GTC AAT GGC GTG TGT TCG ACC GTC TAC CAT GGC 
Fbe Leo Ala Tkr Cjs Val Asn Gly Val Cjs Trp Tkr Val Tjr His Gly 

35 40 45 

GCC GGC TCG AAA ACC CTG GCC GGC CCG AAG GGT CCA GTC ACC CAA ATG 
Ala Gly Ser Lys Tkr Leo Ala Gly Pro Lys Gly Fro Val Tkr Gli Uet 
50 55 60 

TAC ACT AAT GTC GAC CAG GAC CTC GTC CCC TGG CCC GCG CCC TCC GGG 
Tyr Tkr Asn Val Aip Glo Asp Leo Val Gly Trp Pro Ala Pro Ser Gly 
65 70 75 

GCG CGG TCC TTG ACTCCA TGC ACC TCC GGC AGC TCG GAC CTT TAC TTG 
Ala Arg Ser Leo Tkr Pro Cys Tkr Cys Gly Ser Ser As» Lea Tyr Leo 
80 gS 90 95 

GTC ACG AGG CAT GCT GAT GTC ATT CCG CTG CGC CGG CGG GGC GAT AGC 
Val Tkr Arg His Ala Asp Val He Pro Vil Arg Arg Arg Gly Asp Ser 

1 00 105 no 
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10 



15 



20 



25 



SO 



ACG CGG AGC CTG CTT TCC CCC AGG CCC CTC TCC TAG TTG AAG GGC TCC 383 

Art Gly Set Leu Leo Ser Fro Arj Pro Leu Ser Tyr Leo Lys Gly Ser 

lis 120 iZS 

TCA GGT GGT CCA CTC CTT TGC CCC TCG GGG CAC ATT GTG GGC ATC TTC 431 

Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly Bis He Yal Gly He Phe 
no 135 MO 

CGG GCT GCC CTG TGC ACC CGG GGG GTT GCG AAG GCG GTG GAC TTT GTA 4T9 

An Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala Yal Asp Phe Val 
Hi ISO ISS 

CCT CTC GAG TCT ATG GAA ACT ACT ATG CGG TCT CCG GTC TTC ACG GAT S27 

Pro Yal Glu Ser Mel Clu Thr Thr Met Arj Ser Pro Yal Pie Thr Asp 

1(0 16S 170 ITS 

AAT TCA TCC CCC CCG GCC GTA CCG CAG AC A TTC CAA GTC GCC CAT CTC 57 5 

Asn Ser Ser Pro Pro Ala Yal Pro Gin Thr Phe Glo Yal Ala Bis Leo 

180 185 190 



CAT GCC CCC ACT GGC AGC GGC AAG AGC ACT AAG CTG CCG GCT CCA TAC 623 

Bis Ala Pro Thr Gly Ser Gly lys Ser Thr Lys Val Pro Ala Ala Tyr 
" 195 200 20S 

GCA GCC CAG GGA TAC AAG GTA CTC GTC CTG AAC CCG TCC GTT GCC GCC 671 

Ala Ala Glo Gly Tyr Lys Yal Leo Yal Len Asd Pro Ser Yal Ala Ala 

^ 210 215 220 

ACC TTA GCT TTT GGA GCA TAT ATG TCC AAG CCA CAT GGT GTC GAC CCT 719 

Thr lea Gly Phe Gly Ala Tyr Met Ser Lyt Ala His Gly Yal Asp Pro 
« 225 230 235 

AAC ATC AGC ACT GGG GTA ACC ACC ATC ACT ACG GGC GCC CCC ATT ACA 767 

Asa Me Arg TIr Gly Yal Arg Thr He Thr Thr Gly Ala Pro lie Thr 
45 240 24S 2S0 255 

TAC TCC ACC TAT^C AAG TTT CTT CCC GAC CCT GCT TGC TCC GGG GGC 815 

Tyr Ser Thr Tyr Gly Lys Phe Leo Ala Asp Gly Gly Cys Ser Gly Gly 

2(0 2(5 270 



$5 



GCC TAT CAC ATC ATA ATA TGT GAT GAG TGC CAC TCA ACT GAC TCG ACT 863 
Ala Tyr Asp He He He Cys Asp Glu Cys Bis Ser Thr Asp Ser Thr 

275 280 285 
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TCC ATT TTC CGC ATT GGC ACQ GTC CTG GAC CAA CCG GAG ACG GCT CGA 911 

Ser He Leu Gir He Gl; Tlir 1»\ Leu Asp Glo Ala Glu Tbr kit Gly 
« 290 295 300 

GCC CGG CTC GTC CTG GTC GGC ACC COT ACG OCT CCA GGA TCG GTC ACT 959 

Ala Ari Leo Val Val Leu Ala Thr Ala Thr Fro Pro Glf Ser Yal Tkr 
'o 305 3 10 3 1 5 

CTG CCT CAT CCC AAC ATC GAG CAG GTC GCC TTG TCC ACC ACT CGA GAC 1007 

yal Pro Bis Pro Asd He Gla Glo Val Ala Leo Ser Ser Thr CI; Glo 
« 320 325 330 335 

ATT CCC TTC TAT CGC AAA GCC ATC CCC ATT GAG ACC ATC AAC CGG CCA 1055 

lie Pro Pke Tyr GI7 Ljs Ala He Pro He Glo Tkr He Ljs Glj GIr 
» 340 345 350 

AGG CAT CTC ATT TTC TGC CAC • 1076 

kti Bis Len He Pke Cjs Bis 

" 355 
Sequeoce LIsHde 



SEQ ID ho: 11 

SEQUENCE LENGTB:284 base pairs 
SEQUENCE type: BBC leic acid 
STRANDEDKESS:dooble 
TOPOIOGT: linear 

MOLECULE TYPE:cDNA to geoonic RXA 

GTC GAC CCC AAT ATT AGA ACT CGG GTA ACC ACC ATC ACC ACC GCC GCT 48 
Val Asp Pro Asi He Arg Tkr CI; Val Ar{ Tkr He Tkr Tkr CI; Ala 
15 10 15 

CCC ATT ACG TAT TCT ACC TAT CGC AAA TTC CTT GCC CAC CCT GCT TCC 96 
Pro He Tkr Tyr Ser Tkr Tyr Gly Lys Pke Leo Ala Asp Clj Gly Cys 

20 25 30 

TCT CGG GCC GCC TAT GAC ATC ATA ATC TCT CAT GAG TGC CAC TCA ACT , 144 
Ser Gly Gly Ala Tyr Atp He He He Cys Asp Glo Cys His Ser Tkr 
35 40 45 

GAC TCG ACT TCC ATC TTG GGT ATC CGC ACA CCC CTG GAC CAA CCG GAG 192 
Asp Ser Tkr Ser He Leu Gly He Gly Thr Ala Leu Asp Glo Ala Glo 
50 55 60 
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ACG GCT GGA GCA CGG CTT GTC GTG CTC GCC ACC CCT ACG CCT CCA GGC 
Thr Ala Gly Ala Arg Leu Val ¥al Lea Ala Thr Ala Hr Pro Pro Gly 
6S 70 75 



240 



10 



TOG CTC ACC GTG CCG CAT CCC AAC ATC GAG GAG GTA GCC TTG CC 
Ser Val Tkr Val Pro Bis Pro Asd lie Gin Gin Val Ala Lea 

85 90 



2U 



SEQ ID NO: 12 

SEQUENCE LENGTH: 641 base pairs 
SEQUENCE TYPE:aucleic acid 
STRANDEDNESS:joable 
topology: linear 
^ MOLECULE TYPE:cDNA to (caoBic RNA 

G GAC AAC TCA TCT CCC CCG GCG GTA CCG CAG ACA TTC CAC GTC CCC CAT 49 
Asp Aso Ser Ser Pro Pro Ala Val Pro Glo Tkr Phe Glo Val Ala His 
^ 1 5 10 15 

CTA CAC CCT CCC ACT GGC ACC GGC AAC ACC ACT AAC GTG CCC CCT GCA 97 
Leo His Ala Pro Thr Gly Ser Cly Lys Ser Thr Lys Val Pro Ala Ala 

20 2S 30 

TAT CCA GCC CAA GGG TAC AAA GTA CTC GTC CTC AAC CCG TCC CTT GCC 145 
Tyr Ala Ala Gla Gly Tyr Lys Val Leo Val. Lea Ass Pro Ser Val Ala 
^ 35 40 45 

GCC ACC TTA ACT TTC GGG GCC TAT ATG TCC AAC GCA CAT GCT CTT CAC 193 
Ala Tkr Lea Ser Phe Gly Ala Tyr Met Ser Lys Ala Bis Gly Val Asp 
* 50 S5 60 

CCT AAT ATC A6A ACT GGC ACA ACG ACC ATC ACC ACG GGC GCT CCC ATC 241 
Pro Asi lie Arg Tkr Gly Tkr Ar( Tkr He Thr Tkr Gly Ala Pro lie 
65 70 75 80 

ACG TAC TCC ACC TAT^GGC AAG TTC CTT GCA GAC 6GT CCT TGC TCC GGA 289 
Tkr Tyr Ser Tkr Tyr Gly Lys Pke Lea Ala Asp Gly Gly Cys Ser Cly 
" 85 90 95 

GCC GCC TAT GAC ATC ATA ATA TGC CAT CAC TCC CAC TCA ACA CAC TCC 337 
Gly Ala Tyr Asp lie He lie Cys Asp GId Cjs His Ser Tkr Asp Ser 
^ 100 105 HO 
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ACT TCC ATC TTA GGC ATT GGT ACG GTC CTG CAC CAA GCG GAG ACG GCT 

Tbr Ser He Leo Glj He Glj Tbr Val Leu Asp Gin Ala Glo Thr Ala 

115 120 125 

CGA GCG CGA CTC GTC GTG CTC GCC ACC GCT ACG CCT CCA GGA TCC GTC 

Gij Ala Arg Leu Val Val Leo Ala Thr Ala Thr Pro Pro Gly Ser Val 

130 135 140 

ACT GTC CCA CAT CCC AAC ATC GAG GAG GTG GCC CTG TCC AAC ACT GGA 

Thr Val Pro Bis Pro Aso lie Gin Glo Val Ala Leu Ser Aso Thr Glj 

145 150 155 160 

GAG ATT CCC TTC TAT GGC AAA GCC ATC CCC ATT GAG GCC ATC AAG GGG 

GId lie Pro Phe Tjr Glj Lfs Ala lie Pro He Glu Ala He Its Gly 

165 170 175 

GGG AGG CAT CTC ATT TTC TGC CAT TCT AAG AAG AAG TGT GAT GAG CTC 

Gly Ktg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glo Leu 

180 185 190 

GCC ACG AAG CTG TCG CCC CTC GGA CTC AAT CCT GTA GCG TAC TAC CGG 

Ala Thr Lys Leu Ser Ala Leo Gly Leu Asa Ala Val Ala Tyr Tyr Arg 

195 200 205 

GCT CTT GAT GTG TCC G 
Gly Leu Asp Val Ser 
210 



SEQ ID N0:i3 

SEQUENCE LENGTH:432 base pairs 
SEQUENCE TYPElaocieic acid 
STRANDEDNESS: double 
TOPOLOGY: 11 Dear 

llOLECOLE TTP£:cDNA to geoonic RNA 

CA GGC GAG AGG CCCltCA GCC ATG TTT CAC ACC GTA GTC CTC TGT GAG 
Gly Glo Arg Pro Tbr Gly Uet Phe Asp Ser Vai Val Leo Cys Glu 
IS 10 15 

TGC TAT GAT CCC GGG GCC GCC TGG TAC GAG CTT ACG CCT CCT GAG ACT 
Cys Tyr Asp Ah Gly Ala Ala Trp Tyr Glo Lea Tbr Pro Ala Glu Tbr 

20 25 30 
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ACG GTG AGA CTC CGG GCT TAT TTC AAC ACG CCC CGT TTG CCT GTA TOT 
Tbr Val Arg Leu Arg Ah Tyr Phe Asd Thr Pro Gly Leo Pro Val Cjs 

35 <0 45 



143 



10 



CAA GAC CAC CTA GAG TTC TGG CAA GCG GTC TTC ACA GGT CTC ACA CAC 
Glo Asp (iis Leo Giu Phe Trp Gla Ala Val Pbe Thr Qlj Leu Thr Bis 
50 55 60 



191 



15 



20 



25 



30 



ATT GAT GCC CAC TTC CTC TCC CAG ACG AAG CAA GGA GGA GAC AAC TTT 

lie Asp Ala His Phe Leo Ser Gin Thr Lys Gin Gly Gly Asp Aso Phe 

65 ' 70 75 

GCG TAT CTA ACG GCC TAC CAG GCC ACA GTA TGC CCC AGO GCA AAG GCC 

Ala Tyr Leo Thr Ala Tyr Glo Ala Thr Val Cys Ala Art Ma Lys Ala 

80 85 90 95 

CCC CCT CCT TCG TGG CAC GTG ATG TGG AAG TGT CTA ATC AGG CTC AAA 

Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu lie Arg Leo Lys 

100 105 110 

CCT ACA TTG ACT GGT CCT ACC CCC CTC CTG TAC CGC TTG GGT GCC GTG 

Pro Thr Leu Thr Gly Pro Thr Pro Leo Leu Tyr Arg Leo Gly Ala Val 

115 120 125 



239 



287 



335 



383 



35 



ACT AAC GAG GTT ACC CTG ACC CAC CCC GTG ACG AAA TAT ATC GCC ACG T 
Thr Asn Glu Val Thr Leo Thr His Pro Val. Thr Lys Tyr lie Ala Tbr 
130 135 140 



432 



* SEQ ID HO: 14 

SEQUENCE LENGTB:369 base pairs 
SEQUENCE type: nucleic acid 
STRANDEDNESS: double 

* TOPOLOGY: linear 

UOLECULE TYPE:cDNA to genonic RNA 

^ ATG GGC ACG AAT CCT AAA CCT CAA ACA AAA ACC AAA AGA AAC ACT AAC . 48 
Met Gly Thr Asn Pro Lys Pro Gin Arg Lys Tbr Lys Arg Aso Thr Asn 
1 5 10 IS 

„ CGT CGC CCA CAA GAC GTT AAG TTT CCG GGC GGC GGC CAG ATC GTT GGC 96 
Arg Arg Pro Glo Asp Val Lys Phe Pro Gly Gly Gly Gin Me Val Gly 

20 25 30 
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GCA CTA TAC TTG TTG CCG CGC AGG GGC COG AGA TTG GGT GTC CGC GOG 
Cly Val Tjf Lea Leu Pro Arg Arg Glj Pro Arg Leo Glf Val Arg Ala 
J5 40 45 

ACA AGG AAG ACT TCG AAC CGG TCC CAG CCA CGT GGG GGG CGC CCG CCC 
Tkr Arg Lys Tbr Ser lys Arg Ser Glo Pro Arg Gly Gly Arg Arg Pro 
SO 5S 60 

ATC CCT AAA GAT CGG CGC TCC ACT GGC AAG TCC TGG GGG AAA CCA GGA 
He Pio Lys Asp Arg Arg Ser Tbr Gly Lys Ser Trp Gly Lyj Pro Gly 
65 70 75 80 

TAC CCC TGG CCC CTA TAT GGG AAT GAG GGA CTC GGC TGG GCA GGG TCG 
Tyr Pro Trp Pro Leo Tyr Gly Aso Glu Gly Leo Gly Trp Ala Gly Trp 

gS 90 95 

CTT CTG TCC CCC CGA GGT TCC CGT CCC TCT TGG GGC CCC ACT CAC CCC 
Leo Leo Ser Pro Arg Gly Ser Arg Pro Ser Trp <;iy Pro Tbr Asp Pro 

100 105 110 

CGG CAT AGG TCG CGC AAT CTG GGT AAG GTC ATC 
Afg His Arg Ser Arg Asb Val Gly Lys Val lie 
115 120 
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SEQ ID NO: 15 

SEQUENCE LENGTB:932 base pairs 
SEQUENCE TYPElaacleic acid 
STRANDEDKESS: double 
TOPOLOGY: linear 

MOLECULE TTPE:cDNA to Eeoomic RNA 



CG CGC AAC TTG GOT AAG GTC ATC GAT ACC CTC ACA TGC CGC TTC GCC 
Arc AsD Leu Gly Ljs Val He Asp Tkr Leu Thr Cn GI7 f^i Ala 

15 10 15 

GAC CTC ATC GGG TAC ATT CCG CTT GTC GGC GCC CCC CTA GGG GGT GCT 

Asp Leu Met Glj Tjr lie Pro Leo Val Gly Ala Pro Leo Glj Glj Ala 

20 25 30 

GCC AGG GCC CTG GCA CAT GGT GTC CGG GTT CTC GAG GAC GGC CTC AAC 

Ala Arg Ala Leu Ala His Clj Val Arg Val Leu Glu Asp Glj Val Asn 

35 40 45 

TAT GCA ACA GGG AAT TTC CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG 

Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leo 

SO 55 60 

GCT TTG CTG TCC TCT TTG ACC ATC CCA GCT TCC GCT TAT GAG GTC CGC 

Ala Leu Leu Ser Crs Leu Thr He Pro Ala Ser Ala Tjr Glu Val Arg 

65 70 75 



AAC GTA TCC GGG ATA TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGT 
Asn Val Ser Clj He Tyr His Val Tbr Asa Asp Cjs Ser Asa Ser Ser 

85 90 95 



ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAT ACC CCC GGG TGC GTG 
He Val 111 Gil Ala Ala Asp Met He Met His Tkr Pro Gly Cys Val 

100 105 110 

CCC TGC CTT CGG GAG AAC AAC TCC TCC CCT TGC TCG GCA GCC CTC ACT 
Pro Cys Val Arg G1=v Asa Asa Ser Ser Arg Cys Trp Ala Ala Leu Tbr 

115 120 125 



CCC ACG TTA GCG CCC ACC AAC ACC AGC GTC CCC ACT ACG ACA ATA CCA 
Pro Tkr Leo Ala Ala Arg Asa Tbr Ser Val Pro Tkr Thr Thr He Arg 
130 135 149 
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CCG CAT GTC GAT TTG CTC GTT GGC GCG GCT GCT TTC TGC TCC GCT ATG 

Arg His Val Asp Leo Leo Val Glj Ala Ala Ala Phe Cys Ser Ala Uet 

MS ISO ISS 

TAC GTG GGG GAT CTC TGT CGA TCT GTC TTC CTC GTT TCC CAG CTG TTC 

Tyr Val Gly Asp Lto Cys Gly Ser Val Phe Leo Val Sei GId Lea Ph 
ISO ISS 170 17S 

ACT TTC TCA CCT CGT CG6 CAT GAG ACA GTA CAG GAC TGC AAC TGC TCA 

Till Phe Ser Pro Arc Arg Bis Gin Thr Val GU Asp Cys Asn Cys Ser 

180 ISS 190 

ATC TAT CCC GGC CAC TTG ACA GGT CAT CGC ATG GCT TGG GAT ATG ATG 

He Tyr Pro Gly His Leo Tkr Gly His Ar; Uet Ala Trp Asp Uet Uet 

19S 200 20S 

ATC AAC TGG TCA CCT ACA ACA GCC CTA CTG CTG TCG CAT CTA CTC CGG 

Met Asn Trp Ser Pro Thr Thr Ala Leu Vai Val Ser Bis Leu Leo Ar( 
210 215 220 

ATC CCA CAA GCT CTC ATG GAC ATG GTG GCG GCC GCT CAC TGG GGA GTC 

lie Pro GId Ala Val Uet Asp Uet Val Ala Gly Ala His Trp Gly Val 

225 230 235 

CTA GCG GCC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AA6 GTT 

Leu Ala Gly Leo Ala Tyr Tyr Ser Uet Val Gly Asn Trp Ala Lys Val 
240 24$ 250 2$S 

TTC ATT GTG ATG CTA CTC TTC GCC CGC GTT GAC GGG ACC ACC TAT GTG 

Leo He Val Uet Leo Leo Pie Ala Gly Val Asp Gly Thr Thr Tyr Val 

260 265 270 

ACA CGG GGG ACC ACA GGC CGC ACC ACC ACC TCG TTC CCA TCC CTC TTT 

Thr Gly Gly Thr Thr Gly Ar( Thr Thr Ser Ser Phe Ala Ser Lei Phe 

275 2S0 2t5 

ACA CTT GGG TCG XAT CAG AAG GTC CAG CTT ATA AAT ACC AAT GGC AGC 

Thr lea Gly Ser His Gla lys Vai Glo Leo He Aso Thr Asa Gly Ser 
290 295 300 

TGG CAC ATC AAC AGC ACC GCC 

Trp Bis lie Asn Arg Thr Ala 

305 310 
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SEQ ID NO: 16 

SEQUENCE LENGTHiSSS base pairs 
SEQUENCE TYPEioDcIeic acid 
STEANDEDNESSldoulle 
TOPOLOGY: liueir 

UOLECULE TYPElcDHA to Etooniic RNA 

CGC CGG TAT GAG ACG CCC CAA GAG TGC AAT TGC TCA CTC TAT CCC GCT 
Ari Arg Tjr Gh Thi Ala GId Asp Cjs Asn Cys Ser Leu Tyr Pro Gly 
15 10 15 

CAC GTA TCT GCT CAC CGC ATG GCT TGG CAT ATG ATG ATG AAC TGG TCA 
His Vai Ser Gly Bis Arg Uet Ala Trp Asp Met Met Met Asb Trp Ser 

20 25 30 

CCT ACA ACG GCC CTA GTG GTA TCG CAG CTA CTC CGG ATC CCA CAA GCC 
Pro Thr Tkr Ala Leo Val Val Ser Gin Leo Leo Arg He Pro Glo Ala 
35 40 45 

GTC GTG GAC ATG GTG GCG GGC GCC CAC TCG GGA GTC CTA GCG GGC CTT 
Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Lea Ala Gly Leo 
50 55 60 

CCC TAC TAT TCC ATC GTG GCG AAC TGG GCT AAC GTC TTG GTT GTC ATG 
Ala Tyr Tyr Ser Mel Val Ala Asd Trp Ala.Lys Val Leo Val Vil Met 
65 TO 75 SO 

CTA CTC TTT GCC GGC GTT GAC GAC GGG AAG ACC ACC GTG ACG CGG GGG 
Leo Leo Pke Ala Gly Val Asp Asp Gly Lys Tbr Tkr Val Thr Gly Gly 

85 90 95 

AGC GCA GCC TTC CAC TCC ACG AAG TTA GTG TCC TTC TTC TCA CCA GGG 
Ser Ala Ala Pie Gin Ser Arg lys Leo Val Ser Pke Pbe Ser Pro Gly 

100 105 110 

CCG AAA CAA AAT ATC CAG CTT GAT AAC ACC AAC GGC AGC TGG CAC ATC 
Pre Lys Glo Asa I4< Glo Leo Asp Asn Thi Aso Gly Ser Trp His He 

115 120 125 

AAC AGC ACT GCC CTG AAT TGC AAT GAC TCC CTC CAA ACT CCG TTC ATC 
Aso Arg Tkr Ala Leo Asn Cys Aso Asp Ser Leo Gin Thr Gly Phe He 
130 135 140 
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GCT GCG CTG TTC TAC GCG CAC AAG TTC AAT TCG TCC GGA TGC OTA GAG 

Ala Ala Leu Phe Tyr Ala His Lys Phe Asn Ser Ser Cly Cys Leu Clu 

H5 1 50 1 55 160 

CGC ATG GCC AGC TGC CGC CCC ATT GAC AAG TTC GCG CAC CGG TGG GGT 

Arg Met Ala Ser Cys Arg Pro He Asp Lys Phe Ala Gin Gly Trp Gly 

16S no 175 

CCC ATC ACT CAC GAT ACG CCT AAG ATC CCG G 

Pro He Thr His Asp Tbr Fro Lys He Fro 

180 185 



SEQ ID KO:n 

SEQUENCE LE1(GTH:Z7( base pairs 
SEQUENCE TYPElnocleic acid 
STRANDEDNESS'.dooble 
TOPOLOGY: I i tear 

MOLECULE TYPElcDNA to genoBic RHA 

GA CAC CGT ATG GCA TCG GAC ATC ATG ATG AAC TGC TCG CCC ACG GCT 
Bis Arg Met Ala Trp Asp Met Met Met Asa Trp Ser ProTbr Ala 
I 5 10 15 

ACC ATG ATT CTG GCG TAT CTG ATG CCC ATC CCC GAG GTC GTC ATG CAC 
Tkr Met He Leu Ala Tyr Val Met Arg He Fro Ch Val Yal Met Asp 

20 2$ 30 

ATC ATT GGC CGG CCT CAC TGG CGC GTC ATG TTC CGC TTC GCG TAT TTT 
He He Glj Gly Ala His Trp Gly Val Met Phe Gly Leo Gly Tyr Phe 

35 40 45 

TCT ATC CAC CGG GCT TGC CCA AAA GTC GTT GTC ATC CTT CTG CTG GCC 
Ser Met Gli Gly Ala Trp Ala Lys Val Val Val He Lea Leo Leo Ala 
50 55 60 

GCT GCG CTG GAT GCG ACT ACC CTC ACC GTT CGG GGC TCT GCC CCG CAC 
Ala Gly Val Asp Ala Thr Thr Len Ser Val Gly Gly Ser Ala Ala His 
65 70 75 

ACC ACC GGC GGC CTT GTC GGC TTG TTC AAG CCT GGC G 
Thr Tkr Gly Gly Leu Val Gly Leu Phe Lys Pro Gly 
80 8S 90 
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SEQ ID NO: 18 

SEQUENCE LENGTH: 742 base pairs 
SEQUENCE T¥PE:nacleic acid 
STRANDEDNESS:double 
TOPOLOGY: iloear 

MOLECULE TyPE:cDNA to genooiic RNA 

CG CTT GTC GGC GCC CCC CTA GGG GGT GOT GCC AGG GCC CTG GCA CAT 
Leu Val Giy Ala Pro Leo Gly Gly Ala Ala Arg Ala Leo Ala His 
15 10 1!> 

GGT GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT CCA ACA GGC AAT TTG 
Gly Val Arg Val Leo Gh Asp Gly Val Asd Tyr Ala Thr Gly Asn Leo 

20 25 30 

CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG CTG TCC TGT TTG 
Pro Gly Cys Ser Phe Ser He Pie Leu Leu Ala Leu Leu Ser Cys Leu 

35 40 45 

ACC ATC CCA GCT TCC GCT TAT GAG GTG CGC AAC GTA TCC GGC ATA TAC 
Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asd Val Ser Gly He Tyr 
50 55 60 

CAT GTC ACG AAC GAC TGC TCC AAC TCA AGT ATT GTC TAT GAG GCA GCG 
His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala Ala 
65 70 75 

GAC ATG ATC ATG CAT ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG AAC 
Asp Mel lie Met His Thr Pro Gly CysVal Pro Cys Val Arg Gin Asn 
80 85 90 95 

AAC TCC TCC CGT TGC TGG GCA GCG CTC ACT CCC ACG TTA GCG GCC ACG 
Aso Ser Ser Arg Cys Trp Ala Ala Leu Thr Pro Thr Lea Ala Ala Arg 

100 105 HO 

AAC ACC AGC GTC CCC ACT' ACG ACA ATA CGA CGG CAT GTC GAT TTG CTC 
Asn Thr Ser Val Pro^Thr Thr Thr He Arg Arg His Val Asp Leo Leu 

lis 120 125 

GTT GCG GCG GCT GCT TTC TCC TCC GCT ATG TAC GTG GGG GAT CTC TGT 
Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Giy Asp Leu Cys 
130 135 140 
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GGA TCT GTC TTC CTC GTT TCC CAG CTG TTC ACT TTC TCA CCT CGT CCG 

Glj Ser Val Phe Leu Val Ser Glo Leu Phe Thr Phe Ser Pro kig ktg 

145 150 155 

CAT GAG ACA CTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAC TTG 

Bis Glo Thr Val Gin Asp Cys Asd C|fs Ser lie Trr Pro Glj Bis Leu 
1 60 1 65 1 70 175 

ACA GGT CAT CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCA CCT ACA 

Thr Glf Bis Ar; Met Ala Trp Asp Met Uet Met Aso Trp Ser Pro Thr 

180 185 190 

ACA GCC CTA GTC GTG TCG CAT CTA CTC CGG ATC CCA CAA GCT GTC ATG 

Thr Ala Leo Val Val Ser His Leo Leu Ar{ He Pro Gin Ala Val Met 

195 200 205 

GAC ATC GTC GCC GGG GCC CAC TGG CCA GTC CTA GCC GGC CTT GCC TAC 

Asp Met Val Ala Glr Ala His Trp Gly Val Leo Ala Gly Leo Ala Tyr 

210 215 220 

TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT CTG ATG CTA CTC 

Tyr Ser Met Val Gly Asa Trp Ala Lys Val Leo He Val Met Leo Leu 

225 230 235 

TTC GCC GGC GTT GAC GGG ACC AC 

Phe Ala Gly Val Asp Gly Tkr 
240 245 



SEQ ID no: 19 

SEQUENCE LENGTe:20 base pairs 
SEQUENCE TTPEiiBcleic itH 
STXANDEDKESSlsiifle 
TOPOLOGT: linear 

C6ATACACCG GTGACTTTGA 



SEQ ID N0:20 

SEQUENCE LENGTHJO base pairs 
SEQUENCE TYPEinocleic acid 
STRANDEDNESSlsiBtle 
TOPOLOGY: I ioear 
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10 



15 



25 



30 



TGCATGCACG TGGCGATGTA 20 
Sequeoce Li st i q; 



SEQ ID N0:21 

SEQUENCE LENGTH:20 base pairs 
SEQUENCE TYPEiDucleic acid 
STRANDEDNESSlsiogle 
topology: linear 

GATGCCCACT TCCTCTCCCA 20 



SEQ ID NO:22 
20 SEQUENCE LENGTa:20 base pairs 
SEQUENCE TYPElDQcleic acid 
STRANDEDNESS:siDgle 
TOPOLOGY: linear 



CTCAGGGTAA CCTCGTTGGT 20 



Claims 



(1) A DMA fragment comprising a base sequence which encodes a non-A nor>-B hepatitis-specific antigen 
polypeptide from a non-structurat protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

35 whole or a part of the amino add sequence represented by the SEQ ID NO. 1 . 

(2) A DMA fragment comprising a base sequence which encodes a non-A non-B hepatitis-spedfic antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino add sequence represented by the SEQ ID NO. 2. 

(3) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatids-spedfic antigen 
40 pdypeptide from a non-structural protein of the non-A non-B hepatitis viros, said polypeptide consisting of the 

whole or a part of the amino add sequence represented by the SEQ ID NO. 3. 

(4) A DNA fragment comprising a base sequence which encodes a nonA non-B hepatitis-spedfic antigen 
pdypeptide from a non-structural protein of the non^ non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino add sequence represented by the SEQ ID NO. 4. 

45 <5) A DNA fragnwnt comprising a base sequence which encodes a non-A non-B hepatitis-spedfic antigen 
pdypeptide from a non-structural protein of the non-A non-B hepatitis vinjs, said polypeptide consisting of the 
whole or a part of the amino add sequence represented by the SEQ ID NO. 5. 

(6) A DNA fragment comprising a base sequence which encodes a nonA non-8 hepatitis-spedfic antigen 
pdypeptide from a nonHstructurai protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

50 whole or a part of the amino add sequence represented by the SEQ ID NO. 6. 

(7) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-spedfic antigen 
pdypepetide from a non-stiucturai protein of thea non-A non-B hepatitis vbus, said pdypeptide consisting of 
the whole or a part of the amino add sequence represented by the SEQ ID NO. 7. 

(8) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-spedfic antigen 
55 pdypeptide from a non-structural protein of the non-A non-B hepatitis virus, said pdypeptide consisting of the 

whole or a part of ttie amino add sequence represented by ttie SEQ ID NO. 8 

(9) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-spedfic antigen 
pdypeptide from a non-structural protein of ttie non-A non-B hepatitis virus, said polypeptide consisting of tiie 
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whole or a part of the amino acid sequence represented by the SEQ ID NO. 9. 

(10) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ 10 NO. 10. 
5 (11 ) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 

polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino acid sequence represented by the SEQ ID NO. 11. 

(12) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 

10 whole or a part of the amino acid sequence represented by the SEQ ID NO. 12. 

(13) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a non-structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the 
whole or a part of the amino add sequence represented by the SEQ ID NO. 13. 

(14) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
15 polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the w^ole 

or a part of the amino add sequence represented by the SEQ ID NO. 14. 

(1 5) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino add sequence represented by the SEQ ID NO. 15. 

20 (1 6) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino add sequence represented by the SEQ ID NO. 16. 

(17) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
polypeptide from a structural protein of the non-A ndn-B hepatitis virus, said pdypeptide consisting of the whole 

25 or a part of the amino add sequence represented by the SEQ ID NO. 1 7. 

(1 8) A DNA fragment comprising a base sequence which encodes a non-A non-B hepatitis-specific antigen 
pdypeptide from a structural protein of the non-A non-B hepatitis virus, said polypeptide consisting of the whole 
or a part of the amino add sequence represented by the SEQ ID NO. 18. 

(19) An expression vector containing the DNA firagnr^ent according to any one of daims 1 to 18 in a doning 
30 site downstream of a promoter gene in the vector. 

(20) An expression vector according to daim 19, wherein the vector is a plasmid. 

(21) An expression vector according to daim 20, wherein the vector is plasmid Trp-TrpE-C11-C21. 

(22) An expression vector according to daim 20, wherein the vector is plasmid Trp-TrpBC11-7. 

(23) A transfbmnant comprising a host ceU transfonmed with the expression vector according to any one of 
35 daims 19 to 22 

(24) A transfonmant according to daim 23, wherein the host cell is Escherichia coii. 

(25) A prooess for produdng a recombinant non-A non-6 hepatitis-spedfic antigen polypeptide obtained 
by expression of the DNA fragment according to any one of daims 1 to 18, which comprises thefdiowing steps 
of: 

40 constructing a repllcable expression vector which Is capable of expressing said DNA fragment In an 

appropriate host cell; 

obtaining a transfonrant by Introdudng said expression vector Into the host ceil; 
produdng said recombinant pdypeptide by culturing said transfonmant under such conditions that said 
DNA fragment is expressed; and 
45 recovering the recombinant polypeptide. 

(26) A reoombinarit non-A rKm-B hepatitis-spedfic antigen polypeptide obtained by expressing the DNA 
fragment according to any one of daims 1 to 18. 

(27) A method for ampHfying a non-A non-B hepatitis vfms gene using sense and/or antisense sequence 
synthesized on the basis of a partial base sequence of the DNA fragment according to any one of daims 1 to 

60 18. 

(28) A single strand DNA seqirance for PGR primer represented by SEQ ID NO. 19. 

(29) A single strand DNA sequence for PGR primer represented by SEQ ID NO. 20. 

(30) A single strand DNA sequence for PGR primer represented by SEQ ID NO. 21 . 

(31) A single strand DNA sequence for PGR primer represented by SEQ ID NO. 22. 

55 (32) A method for detecting a non-A non-B hepatitis vims gene in a fluid sample to be examined, which 
comprises the fdlowing steps of: 

isdating RNA from a fluid sample to be examined; 

synthesizing cDNA by treating the RNA with a reverse transcript^e; 
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subjecting ttie cDNA to polymerase chain reaction using at least one single strand DMA sequence 
according to any one of daims 28 to 31; and 

detecting an amplified non-A non-B hepatitis virus gene. 

(33) An invnunological method for detecting an antifc>ody directed against a non-A non-B hepatitis virus anti- 
5 gen, which comprises the following steps of: 

incut)ating a fluid sample which may contain an anti-non-A non-B hepatitis virus antibody, in the presence 
of at least one recombinant non-A non-& hepatitis-specific-antigen polypeptide according to daim 26 under 
such conditions that said antibody and said polypeptide are capable of undergoing antigen-antibody reaction; 
and 

10 detecting an antigen-antibody complex. 

(34) Use of the recombinant non-A non-6 hepatitis-specific antigen polypeptide according to datm 26 in 
the detection of the non-A non-B hepatitis virus. 

(35) Use of the single strand DNA sequence for PGR primer according to any one of daims 28 to 31 in the 
detection of the non-A non-B hepatitis virus. 
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Fi g. 2 



10 20 30 40 50 60 

CACGCCCGGTTTGCCCGTGTGTCAAGACCACCTGGAGTTCTGGGAAGCGGTCTTCACAGG 
ThrProGlyLeuProValCysGlnAspHi sLeuGluPheTrpGluAlaVa IPheTbrGl y 

70 80 90 100 110 120 

TCTCACGCACATTGATGCCCACTTCCTCTCCCAGACAAAGCAAGGAGGAGACAACTTCGC 

LeuThrHi sIleAspAlaHi sPheLeuSe rG 1 nThrLy sG 1 nG 1 yG 1 yAspAsnPhe A 1 a 

130 140 150 160 170 ISO 

GTATCTAACGGCCTACCAGGCCACAGTGTGCGCTAGGGGAAAGGCCCCTCCTCCCTCGTG 
Ty rLeuThrAUTy rGlnAlaThrValCysAlaArgAlaLysAlaProProProSerTrp 

190 200 210 220 230 240 

GGATGTGATGTGGAAATGTCTAGCTAGGCTGAAGCCTACACTAATTGGTCCTACCCCCCT 
AspValMe tTrpLy sCy sLeuA 1 aA r gLeuLy sPr oTh rLeu 1 1 eG 1 yProTh rProLeu 

250 260 270 280 290 300 

CCTGTACCGCTTGGGTGCCGTGACCAACGAGGTTACCCTGACGCACCCCGTGACGAAATA 
LeuTyrArgLeuGlyAlaValThrAsnGluValThrLeuThrHi sProValThrLysTyr 

310 320 330 340 350 360 

CATCGCCACGTGCATGCAAGCTGACCTCGAGATCATGACGAGCACATGGGTCCTAGCAGG 

IleAlaThrCysMe tGlnAlaAspLeuGluI leMe tThrSerThrTrpValLeuAlaGly 

370 380 390 400 410 420 

GGGGGTGCTAGCCGCCGTGGCAGCTTACTGCCTGGCAACCGGCTGTGTTTCCATCATCGG 

GlyValLeuAlaAlaValAlaAlaTyrCysLeuAlaThrGlyCysValSerllelleGly 

430 440 450 460 470 480 

CCGCCTACACCTGAATGATCAAGTGGTTGTGACTCCTGACAAAGAAATCTTATATGAGGC 
ArgLeuHisLcuAsnAspGlnValValValThrProAspLysGluI leLeuTy rGluAla 

490 500 510 520 530 5A0 

CTTTGATGAGATGGAAGAATGCGCCTCCAAAGCCGCCCTCATTGAGGAAGGGCAGCGGAT 

PheAspGluMe tGl^luCysAlaSerLysAlaAlaLcuI leGluGluGlyGlnArgMe t 

550 560 570 580 590 600 

GGCGGAGATGCTCAAGTCTAAGATACAAGGCCTCCTACAACAGGCCACAAGACAGGCCCA 

AlaG JuMe tLcuLysSerLys I 1 eG 1 nG 1 yLeuLeuG J nG 1 nA 1 aTh r Ar gG 1 nA 1 aG 1 n 



610 

AGACATACAGCCAGC 
Asp I I eG 1 nPro 
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Fig. 4 



10 20 30 40 50 60 

CTGGTATGAACTTACGCCTGCTGAGACTACGGTGAGACTCCGGGCCTATTTCAACACGCC 
TrpTy rGluLeuThrProAlaGluThrThrValArgLcuArgAl aTyr PheAsnThrPro 



70 80 90 100 110 120 

CGGCCTGCCTGTGTGTCAAGACCACCTGGAATTCTGGGAGGCGGTCTTCACAGGTCTCAC 
GlyLeuProValCysGlnAspHisLeuGluPheTrpGluAl aValPheThrGlyLeuThr 



130 140 150 160 170 180 

ACACATCGATGCCCACTTCCTCTCCCACACGAAGCAAGGAGGAGATAACTTTGCATATTT 
His IleAspAlaHi sPheLeuSerGlnThrLysGlnGlyGlyAspAsnPheAlaTy rLeu 



190 200 210 220 . 230 240 

AACAGCCTACCAGGCCACAGTCTGCGCTAGGGCAAAGGCTCCCCCTCCTTCGTGGGACGT 
ThrAlaTyrGlnAlaThrValCysAlaArgAlaLysAlaProProProSerTrpAspVal 



250 260 270 280 290 300 

GATGTGGAAGTGTTTGATTAGGCTCAAACCTACACTGACTGGTCCTACCCCCCTCCTGTA 
Me tTrpLysCysLcuIleArgLeuLysProThrLeuThrGlyProThrProLeuLcuTyr 



310 320 330 340 350 360 

CCGCTTGGGTGCCGTGACCAACGAGGTTACCCTGACTCACCCCATGACGAAATATATCGC 

ArgLeuGlyAlaValThrAsnGluValThrLcuThrHlsProMc tThrLysTyrllcAl 



370 380 390 400 410 420 

CACTTGTATGCAAGCTGATCTTGAGATCATGACAAGCACATGGGTCTTGGCGGGGGGGGT 

ThrCysMeiGlnAlaAspLeuGluIleMe tThrScrThrTrpValLeuAlaGlyGlyVa 



430 440 450 460 470 480 

GCTAGCCGCTGTGGCAGCTTACTGCCTAGCGACCGGCTCCATTTCCATCATTGGCCGCCT 

LeuAlaAlaValAlaAlaTyrCysLeuAlaThrGlyCysIleSerrielleGlyArgLeu 



490 500 510 520 530 540 

TCACCTGAATGATCGGGTGGTCGTGACCCCTGATAAGGAAATTTTATATGAGGCCTTTGA 
HI sLeuAsnAspArgValValValThrProAspLysGluIleLeuTyrGluAlaPhcAsp 



550 560 570 580 590 " 600 

TGAGATGGAAGAGTGCGCCTCCAAAGCCGCCCTCATTGAGGAAGCGCAGCGGATGGCGGA 

GluMe tGluGluCysAlaSerLysAlaAlaLeu I 1 cG luG I uG 1 yG 1 nArgMe tAiaGlu 



610 620 630 

GATGCTGAAGTCTAAAATACAAGGCCTCTT 

Me tLeuLy sSe rLy s I leGlnGlyLeu 
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Fig. 5c 



790 800 810 820 830 840 

GCTGAGACCACAGTGAGACTCCGGGCTTATTTCAACACGCCCGGTTTGCCCGTGTGTCAA 
AlaGluThrThrValArgLeuArgAlaTyrPheAsnThrProGlyLeuProValCysGIn 



. 850 860 870 880 890 900 

GACCACCTGGAGTTCTGGGAAGCGGTCTTCACAGGTCTCACGCACATTGATGCCCACTTC 
AspHi sLeuG luPheTrpGluAlaValPheThrGlyLeuThrHi sIlcAspAlaHi sPhe 



910 920 930 940 950 960 

CTCTCCCAGACAAAGCAAGGAGGAGACAACTTCGCGTATCTAACGGCCTACCAGGCCACA 

LeuSerGlnThrLysGlnGlyGlyAspAsnPhcAlaTyrLeuThrAlaTyrGlnAlaThr 



970 980 990 1000 1010 1020 

GTGTGCGCTAGGGCAAAGGCCCCTCCTCCCTCGTGGGATGTGATGTGGAAATGTCTAGCT 
ValCysAlaArgAlaLysAlaProProProSerTrpAspValMetTrpLysCysLeuAla 



1030 1040 1050 1060 1070 1080 

AGGCTGAAGCCTAGACTAATTGGTCCTACCCCCCTCCTGTACCGCTTGGGTGCCGTGACC 
ArgLeuLysProThrLeulleGlyProThrProLeiiLcuTyrArgLeuGlyAlaValThr 



1090 1100 1110 1120 1130 1140 

AACGAGGTTACCCTGACGCACCCCGTGACGAAATACATCGCCACGTGCATGCAAGTGAAC 
AsnGluValThrLeuThrHisProValThrLysTyrllcAUThrCysMe tGlnValAsn 



1150 1160 1170 1180 1190 1200 

CTCGAGATCATGACGAGCACATGGGTCCTAGCAGGGGGGGTGCTAGCCGCCGTGGCAGCT 
LeuGluI leMe tThr Se rThrTrpValLeuAlaGlyGlyValLeuAl aAl aVa lAl aAl a 



1210 1220 1230 1240 1250 1260 

TACTGCCTGGCAACCGGCTGTGTTTCCATCATCGGCCGCCTACACCTGAATGATCAAGTG 
TyrCysLcuAlaThrGlyCysValScrl le IleGlyArgLeuHlsLcuAsnAspGlnVa 1 



1270 1280 1290 1300 1310 1320 

GTTGTGACTCCTGACAAAGAAATCTTATATGAGGCCTTTGATGAGATGGAAGAATCCGCC 
ValValThrProAsrLysGluI IcLeuTyrGluAlaPhcAspGluMe tG luG luCy sA 1 a 



1330 1340 1350 1360 J 370 1380 

TCCAAACCCGCCCTCATTGAGGAAGGGCAGCGGATGGCGGAGATGCTCAAGTCTAAGATA 

Se rLysA laAl aLeu M eG 1 uC 1 uG 1 yG 1 nA r gMe tAl aGluMe tLeuLysSe rLy s I I e 



1390 1400 14.10 1420 

CAAGCCCTCCTACAACAGGCCACAAGACAGGCCCAACACATACACC 
G InG lyLeuLeuGInG I nAlaThrArgG I nAlaG InAsp MeCln 
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Fig. 6b 



370 380 390 400 410 420 

GAGCGCGGCTCGTCGTGCTCGCTACCGCTACGCCTCCGGGATCGGTCACCGTGCCACATC 
AlaArgLeuValValLeuAlaThrAlaThrProProGlySerValThrValProHlsPro 

430 440 450 460 470 480 

CCAATATCGAGCAGGTGGCCCTGTCCACCACTGGAGAGATTCCCTTCTACGGCAAAGCTA 
AsnlleGluGluValAlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalle 

490 500 510 520 530 540 

TCCCCATCGAGACAATCAAGGGGGGGAGGCATCTCATCTTCTGCCGTTCCAAGAAGAAGT 
Pro I leGluThrlleLysGlyGlyArgHi sLeuI lePheCysArgSerLysLysLysCys 

550 560 570 580 590 600 

CTCACGAGCTCGCTGGAAAGCTGTCAGCCCTCGGAATCAACGCTGTAGCGTACTACCGGG 

AspG luLeuAl aGl yLysLeuSe rAlaLeuGly I IcAsnAlaVa lA I aTy rTy rArgGl y 

glO 620 830 640 650 660 

GTCTTGATGTATCCGTCATACCGACCAGCGGAGACGTCGTTGTCGTCCCAACAGACGCTC 
LeuAspValSerVal U eProTh rSe rG 1 yAspVa 1 Va IVa 1 Va 1 A 1 aTh r AspAl aLeu 

670 680 690 700 710 720 

TAATGACCGCCTACACCGGTGACTTTGATTCAGTGATCGACTGCAATACATGTGTCACCC 
Me tThrGlyTyrThrGlyAspPheAspSerVal I leAspCysAsnThrCysValThrGln 

730 740 750 760 770 780 

AGACAGTCGACTTCAGCTTGGACCCTACCTTCACCATTGAGACGACGACCGTGCCTCAAG 
ThrValAspPheSerLeuAspProThrPheThrlleGluThrThrThrValProGlnAsp 

790 800 810 820 830 840 

ACGCGGTGTCACGCTCCCAGCGGCCAGGCAGAACTGGTAGGGGTAGAGGGGGCATATACA 

AlaValSerAr«Se^lnAr«ArgGlyArgThrGlyArgGlyArgGlyGlyIleTyrArg 



850 

GGTTTGTGAGTCCAG 
PheValThrPro 
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Fig. 7 



10 20 30 40 50 60 

GACGAGCTCGCCGCAAAGCTGTCAGGCCTCGGAGTCAATGCTGTGGCATACTACCGGGGT 

AspGluLeuAlaAlaLysLeuSerGlyLeuGlyValAsnAlaValAlaTyrTyrArgGly 

70 80 90 100 110 120 

CTCGATGTGTCTGTCATACCGACGAGCGGGGACGTCGTTGTTGTGGCAACAGACGCTCTA 
LeuAspValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 

130 140 150 160 170 180 

ATGACGGGCTATACCGGCGACTTTGACTCGGTGATCGACTGCAATACATGTGTCACCCAA 
MetThrGlyTyrThrGlyAspPheAspSerVallleAspCysAsnThrCysValThrGln 

190 200 210 220 230 240 

ACAGTCGATTTCAGCTTGGACCCTACTTTCACCATTGAGACGACGACCGTGCCCCAAGAC 

ThrValAspPheSerLeuAspProThrPheThrlleGluThrThrThrValProGlnAsp 

250 260 270 280 290 300 

GCGGTGTCGCGCTCGCAGCGGCGAGGCAGGACTGGTAGGGGCAGGGTGGGCATATACAGG 

AlaValSerAr«SerGlnArgArgGIyAr«ThrGlyArgGlyArgValGlyIleTyrArg 



310 

TTTGTGACTCCCGAG 
PbeValThrProClu 
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Fig. 8a 



10 20 30 40 SO SO 

GTGATGAGCTCGCCGCAAAGCTCTCAAGCCTCGGACTCAACGCTGTAGCATATTACCGGG 

AspGluLeuAlaAlaLysLeuSerSerLeuGlyLeuAsnAlaValAlaTyrTyrArgGly 

70 80 90 100 110 120 

GTCTTGATGTGTCCGTCATACCGACTAGTGGAGACGTCGTTGTCGTGGCAACAGACGCTC 
LeuAspValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 

130 140 150 160 170 180 

TAATGACGGGCTATACCGGCGACTTTGACTCAGTGATCGACTGTAACACATGTGTCACCC 
MetThrGlyTyrThrGlyAspPheAspSerValllcAspCysAsnThrCysValThrCln 

190 200 210 220 230 240 

AGACAGTTGATTTCAGCTTGGATCCAACCTTCACCATTGAGACGACGACCGTGCCTCAAG 
ThrValAspPheSerLeuAspProThrPheThrlleGluThfThrThrValProGlnAsp 

250 260 270 280 290 300 

ACGCGGTGTCGCGCTCGCAGCGGCGAGGTAGGACTGGCAGGGGCAGGGGCGGCATCTATA 

AlaValSerArgSerGlnArgArgGlyArgThrGlyArgGlyArgGlyGlylleTyrArg 

310 320 330 340 350 ^^^^360 

GGTTTGTGACTCCAGGAGAACGGCCCTCGGGCATGTTCGATTCCTCGGTCCTGTGTGAGT 
PheValThrProGlyGluArgProSerGlyMetPheAspSerSerValLeuCysGluCys 

370 380 390 400 410 420 

GTTATGACGCGGGCTCTGCTTCGTATGAGCTCACGCCCGCCGAGACCACGGTTAGGTTGC 

TyrAspAlaGlyCysAUTrpTyrGluLeuThrProAUGluThrThrValArgLeuArg 

430 440 450 460 470 480 

GGGCTTACCTAAATAClrcCAGCGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGG 

AlaTyrLeuAsnThrProGlyLeuProValCysGlnAspHlsLeuGluPheTrpGluGly 

490 500 510 520 530 540 

GCGTCTTCACAGGCCTCACCCACATAGATGCCCATTTCTTGTCTCAGACTAAGCACGCAG 

ValPheThrGlyLeuThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnAlaOiy 
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F i g.db 



550 560 570 580 590 600 

GAGACAACTTTCCCTACCTGGTGGCATACCAAGCTACAGTGTGCCCCAGGCCTCAGGCTC 
AspAsnPheProTyrLeuValAlaTyrGlnAUThrValCysAlaArgAlaGlnAlaPro 

GiO 620 630 640 650 660 

CACCTCCATCGTGGCACCAAATGTGGAAGTGTCTCATACGGCTGAAACCTACGCTGCACG 

ProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGly 

S70 680 690 700 710 720 

GCCCAACACCCCTGCTGTATAGGCTAGGAGCCGTCCAAAATCAGGTCACCCTCACACACC 

ProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluValThrLeuThrHisPro 

730 740 750 760 770 780 

CCATAACCAAATTCATCATGGCATGCATGTCGGCTGATCTGGAGGTCGTCACCAGCACCT 
IleThrLysPhelleMetAlaCysMetSerAlaAspLeuGluValValThfSerThrTfP 

790 800 810 820 830 

GGGTCCTGGTGGGCGGAGTCCTTGCAGCTCTGGCCGCATATCGCCTGACAACAGGCAGCG 

ValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrArgLeuThrThrGlybervai 

850 860 870 880 890 900 

TGGTCATCGTGGGTAGGATCATCTTGTCTGGGAGGCCGGCTGTCATTCCCGACAGGGAAG 

VallleValGlyArgllelleLeuSerGlyArgProAUVallleProAspArgGluVal 



910 
TCCTTTACCGG 
LeuTyrArg- 
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Fig. 9 



10 20 30 40 50 60 

CGACAACCGTGCCCCAAGACGCGGTGTCGCGCTCACAACGGCGGGGTAGGACAGGTAGGG 

ThrThrValProGlnAspAlaValSerArgSerGlnArgArgGlyArgThrGlyArgGly 

70 80 90 100 110 120 

GCAGGAGAGGCATCTACAGATTTGTGACTCCGGGAGAACGGCCCTCGGGCATGTTCGATT 
ArgArgGlylleTyrArgPheValTbrProGlyGluArgProSerGlyMetPheAspSer 

130 140 150 160 170 180 

CTTCGGTCCTGTGTGAGTGCTATGACGCGGGCTGCGCTTGGATCGAGCTCACGCCCGCCG 

SerValLeuCysGluCysTyrAspAUGlyCysAlaTrpIleGluLeuThrProAlaGlu 

190 200 210 220 230 240 

AGACCTCAGTTAGGTTGCGGGCTTACCTAAATACACCAGGGTTGCCCGTCTGCCAGGACC 
ThrSerValArgLeuArgAlaTyrLeuAsnThrProGlyLeuProValCysGlnAspHis 

250 260 270 280 290 300 

ACCTGGAATTCTGGGAGAGCGTCTTCACAGGCpCACCCATATAGATGCCCACTTC^^^ 
LeuGluPheTrpCluSerValPheThrClyLcuThrHislleAspAlaHlsPheLeuSer 

310 320 330 340 350 360 

CCCAGACCAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCAAGCTACA^ 

GlnThrLysGInAUGlyAspAsnPheProTyrLeuValAUTyrGlnAUThrValCys 

370 380 390 400 410 420 

CCGCCAGGGCCCAGGCTCCACCACCATCGTGGGATCAAATGTGGAAGTGTCTCATACCCC 

AlaArgAlaGlnAUProProProSerTfpAspGlnMetTrpLysCysLeuneArgLeu 

430 440 450 460 4-70 480 

TGAAACCTACGCTACACGGGCCAACACCCCTGTTGTATAGGCTGGGAGGCCTCCAAAA^ 
LysProThrLeuHisGlyProThrPfoLeuLcuTyrArgLeuGlyAlaValGlnAsnGlu 



AGGTCACCC 
ValThr 
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Fig. 10 o 



1 10 20 30 40 50 60 

GTGGTCTCCTGGGTGCCATCGTGGTCAGCCTAACGGGCCGCGACAAGAACCAGGTCGAGG 
GlyLeuLeuGlyAlalleValValSerLeuThrGlyArgAspLysAsnGlnValGluG 



7rt 80 90 lO" 

GGGAGGTTCAGGTGGTCTCCACCGCAACGCAATCTTTCCTGGCGACCTGCGTC^ 

lyGluValGlnValValSerThrAlaThrGlnSerPheLeuAUThrCysValAsnGlyva 



70 80 90 100 110 

AGGTT 
yGl uVal 

,on 140 150 160 170 

gtgttgga2cgtctaccatggcgccggctcgaaaaccctggccggcccga^^ 

ICysTrpThrValTyrHisGlyAlaGlySerLysThrLeuAlaGlyProLysGlyProvai 

ACCCAi!?GTACACTrA?GTGGACcUGACCTCGTCGGCTGGCCGGC^^ 
ThrGlnMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProSerGlyAlaA 

«icn 9fin 270 280 290 

GGTCCT?GACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGT^^^^ 
rgSerLeuThrProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAS 

TGT?i?TCCGGTG?G?CGGCGGGGCGATAGCAGGjGGAGCCTGCT?TCCCCCAGGqC^^ 
ISI?neProvIlArgArgArgGlyAspSerArgGlySerL^^^^ 

tcctUgaaggg°ctcctcagg?Sgtccactg^^^^^^ 

SerTyrLeuLysGlySerSerGlyGlyProLeuLeuCysProSerGlyHisUevaiuiy 



430 440 450 460 



470 480 



TC??CCGGGCT;aaGTGTGCACCCGGGGGGTTGCGAAGGCGGTGGAC^^ 
lePheArgAlaAlaValCysThrArgGlyValAlaLysAlaValAspPheValProValui 

J??TATGGAAA2?ACTATGCGi?CTCCGGT™^ 

uSerMetGluThrThrMetArgSerProVaiPheThrAspAsnSerSerProFroAl 

CTA 580 S90 600 



740 750 760 



770 780 

CTCCACCTATGGCAAGTTTCTTG 



GGGGTAAaaACCATCACTACGGGCGCCCCCATTACATACTCCACO..^ 
G 1 y Ya 1 ArgThr I 1 cThrThrGl yAl aPro 1 1 cThrTyrSerTtir yru 
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Fig. lOb 



800 810 820 830 840 

CCGACGGTGGTTGCTCCGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACtCAAC 

laAspGlyGlyCysSerGlyGlyAlaTyrAspI 1 el 1 el 1 eCysAspGluCysHi sSerTh 

860 870 880 890 900 

TGACTCGACTTCCATTTTGGGCATTGGCACGGTCCTGGACCAAGCGGAGACGGCTGGAGCG 

rAspSerThrSerlleLeuGlyl leGlyThrValLeuAspGlnAlaGluThrAlaGlyAla 

920 930 . 940 950 960 970 

CGGCTCGTCGTGCTCGCCACCGCTACGCCTCCAGGATCGGTCACTGTGCCTCATCCCAACA 

ArsLeuYalValLeuAlaThrAlaThrProProGlySerValThrYalProHisProAsnl 

980 990 1000 1010 1020 1030 

TCGAGGAGGTGGCCTTGTCCAGCACTGGAGAGATTCCCTTCTATGGCAAAGCCATCCCCAT 
leGluGluValAlaLeuSerSerThrGlyGluIleProPheTyrGlyLysAlal leProI 1 

1040 1050 1060 1070 

TGAGACCATCAAGGGGGGAAGGCATCTCATTTTCTGCCAC 

eG 1 uThr 1 1 eLys G I yG 1 yArgH i s Leu 1 1 ePh eCys His 
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Fig. I I 



1 10 20 30 40 SO 60 

GTCGACCCCAATATTAGAACTGGGGTAAGGACCATCACCACGGGCGCTCCCATTACGTAT 
ValAspProAsnl leArgThrGlyValArgThrlleThrThrGlyAlaProI leThrTyr 

70 80 90 100 110 

TCTACCTATGGCAAATTCCTTGCCGACGGTGGTTGCTCTGGGGGCGCCTATGACATCATAA 

SerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspIlel lei 

130 140 150 160 170 

TCTGTGATGAGTGCCACTCAACTGACTCGACTTCCATCTTGGGTATCGGCACAGCCCTGGA 

1 eCys AspGl uCysHi sSerThrAspSerThrSer 1 1 eLeuGl yl 1 eGlyThrAl aLeuAs 

190 200 210 220 230 

CCAAGCGGAGACGGCTGGAGCACGGCTTGTCGTGCTCGCCACCGCTACGCCTCCAGGGTCG 

pGlnAlaGluThrAlaGlyAlaArsLeuValValLeuAlaThrAlaThrProProGlySer 

250 260 270 280 

GTCACCGTGCCGCATCCCAACATCGAGGAGGTAGCCTTGCC 
ValThrValProHisProAsnl leGluGluYalAlaLeu 
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Fi g . 12 



20 30 40 

GGACAACTCATCTCCCCCGGCGGTACCGCAGACATTCCAGGTGGCCCATCTACACGCTCC 

70 80 90 100 110 

CACTGGCAGCGGCAAGAGCACTAAGGTGCCGGCTGCATATGCAGCCCAAGGGTACAAAGTA 

oThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 

130 140 150 160 170 

CTCGTCCTGAACCCGTCCGTTGCCGCCACCTTAAGTTTCGGGGCGTATATGTCCAAGGCAC 

LeuValLeuAsnProSerValAlaAlaThrLeuSerPheGlyAlaTyrMetSerLysAlaH 

190 200 210 220 230 

ATGGTGTTGACCCTAATATCAGAACTGGGACAAGGACCATCACCACGGGCGCTCCCATCAC 

isGlyValAspProAsnlleArgThrGlyThrArgThrlUThrThrGlyAlaProIleTh 

250 260 270 280 290 

GTACTCCACCTATGGCAAGTTCCTTGCAGACGGTGGTTGCTCCGGAGGCGCCTATGACATC 

rTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspIle 

•*10 320 330 340 350 360 

ATAATATGCGATGAGTGCCACTCAACAGACTCGACTTCCATCTTAGGCATTGGTACGGTCC 

IlelleCysAspGluCysHisSerThrAspSerThrSerlleLeuGlylleGlyThrValL 

•^70 380 390 400 410 420 

TGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTACGCCTCCAGG 

euAspGlnAlaGluThrAlaGlyAUArgLeuValValLeuAlaThrAlaThrProProGl 

A^?2GTCACTG^^?CACATCC^AACATCGAGGi2GTGGCCC^GTCCAACACTG^^ 
yScrValThrValProHlsProAsnneGluQluYalAlaLeuSerAsnThrGlyGluIle 

c??ttctatgg2aaagccatcc?cattgaggc?atcaaggPgggagg^ 

ProPheTyrGlyLysAlalleProIleGluAlalleLysGlyGlyArsHisLeuIUPhec 

«Kn 560 570 580 590 600 

G^CATTCTAAGAAdT^AGTGTGATGAGCTCGCCACGAAGCTGTCGGCCCT^ 
ysHisSerLysLysLysCysAspGluLeuAlaThrLysLcuSerAULeuGlyLeuAsnAi 

610 620 630 640 

tgtagcgtactaccggggtcttgatgtgtccg 

aVal AlaTyrTyrArgGlyLeuAspYalSer 
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Fig. 13 



1 10 20 30 40 50 60 

CAGGCGAGAGGCCGACAGGGATGTTTGACAGCGTAGTGCTCTGTGAGTGCTATGATGCCG 

G 1 yG 1 uArgProThrG 1 yMe t PheAspSerVa 1 Val LeuCysG 1 uCysTyrAspA 1 aG 

70 80 90 100 110 

GGGCCGCCTGGTACGAGCTTACGCCTGCTGAGACTACGGTGAGACTCCGGGCTTATTTCAA 

lyAlaAlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrPheAs 

130 140 150 160 170 

CACGCCCGGTTTGCCTGTATGTCAAGACCACCTAGAGTTCTGGGAAGCGGTCTTCACAGCT 

nThrProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluAlaValPheThrCiy 

190 200 210 220 230 

CTCACACACATTGATGCCCACTTCCTCTCCCAGACGAAGCAAGGAGGAGACAACTTTGCCT 

LeuThrHisIleAspAlaHlsPheLeuSerGlnThrLysGlnGlyGlyAspAsnPheAlaT. 

9cn 260 270 280 290 

ATCTAAtGGCCTACCAGGCCACAGTATGCGCCAGGGCAAAGGCCCCCCCT^^^ 
yrLeuThrAlaTyrGlnAlaThrValCysAlaArgAlaLysAlaProPrcfProSerTrpAs 

-Jin ■ 320 330 340 350 360 

CGTGATGTGGAAGTGTCTAATCAGGCTCAAACCTACATTGACTGGTCCTACCCCCCTCC^ 

pValMe tTrpLysCysLeuI 1 eArgLeuLysProThrLeuThrGI yProThrProLeuLeu 

ta?cgcttgggtgwgtgactaacLggttacJ??gacgcaccc?gtgacgm^ 

TyrArgLeuGlyAlaValThrAsnGluValThrLeuThrHlsProValThrLysTyrlleA 

430 
CCACGT 
laThr 



67 



EP 0 468 657 A2 



Fig. 14 



1 10 20 30 40 50 60 

ATGGGCACGAATCCTAAACCTCAAAGAAAAACCAAAAGAAACACTAACCGTCGCCCACAA 
Me tG 1 yThrAsnProLysProGI nArgLysThrLysArgAsnThrAsnArgArgProGl n 

70 80 90 100 110 

GACGTTAAGTTTCCGGGCGGCGGCCAGATCGTTGGCGGAGTATACTTGTTGCCGCGCAGGG 
AspValLysPheProGlyGlyGlyGlnlleValGlyGlyValTyrLeuLeuProArgArgG 

130 140 150 160 170 

GCCCCAGATTGGGTGTGCGCGCGACAAGGAAGACTTCGAAGCGGTCCCAGCCACGTGGGGG 
lyProArgLeuGlyValArgAlaThrArgLysThrSerLysArgSerGlnProArgGlyGl 

190 200 210 220 230 

GCGCCGGCCCATCCCTAAAGATCGGCGCTCCACTGGCAAGTCCTGGGGGAAACCAGGATAC 

yArgArgProI leProLysAspArgArgSerThrGlyLysSerTrpGlyLysProGlyTyr 

250 260 270 280 290 

CCCTGGCCCCTATATGGGAATGAGGGACTCGGCTGGGCAGGGTGGCTTCTGTCCCCCCGAG 

ProTrpProLeuTyrGlyAsnGluGlyLeuGlyTrpAlaGlyTrpLeuLeuSerProArgG 

310 320 330 340 350 360 

GTTCCCGTCCCTCTTGGGGCCCCACTGACCCCCGGCATAGGTCGCGCAATGTGGGTAAGGT 
lySerArgProSerTrpGlyProThrAspProArgHisArgSerArgAsnValGlyLysVa 



CATC 
llle 
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Fig. 15 o 



1 10 20 30 40 50 60 

CGCGCAACTTGGGTAAGGTCATCGATACCCTCACATGCGGCTTCGCCGACCTCATGGGGT 

ArgAsnLeuGl yLysVal 1 1 eAspThrLeuThrCysGl yPheAl aAspLeuMe tGl yT 

70 80 90 100 110 

ACATTCCGCTTGTCGGCGCCCCCCTAGGGGGTGCTGCCAGGGCCCTGGCACATGGTGTCCG 

yrl 1 eProLeuValGl yAl aProLeuGl yGlyAl aAl aArgAl aLeuAl aHi sGl yVal Ar 

130 140 150 160 170 

GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATC 

gValLeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPheSerl le 

190 200 210 220 .230 

TTCCTCTTGGCTTTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAGGTGCGCAACG 

PheLeuLeuAl aLeuLeuSerCysLeuThrl 1 eProAl aSerAl aTyrGluVal ArgAsnV 

250 260 270 280 290 

TATCCGGGATATACCATGTCACGAACGACTGCTCCAACTCAAGTATTGTGTATGAGGCAGC 

alSerGlylleTyrHisValThrAsnAspCysSerAsnSerSerlleValTyrGluAlaAl 

310 320 330 340 350 360 

GGACATGATCATGCATACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAACTCCTCCCGT 

aAspMe 1 1 1 eMe tHi sThrProGl yCysVal ProCysVa 1 ArgG 1 uAsnAsnSerSerArg 

370 380 390 400 410 420 

TGCTGGGCAGCGCTCACTCCCACGTTAGCGGCCAGGAACACCAGCGTCCCCACTACGACAA 

CysTrpA 1 aA 1 aLeuThrPr oThrLeuAl aAl aArgAsnThrSerVa 1 ProThrThrThr l 

430 440 460 460 470 480 

TACGACGGCATGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTGCTCCGCTATGTACGTGGG 

leArgArgHisValAspLeuLeuValGlyAIaAlaAlaPheCysSerAlaMetTyrValGl 
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Fig. 15 b 



490 500 510 520 530 540 

GGATCTCTGTGGATCTGTCTTCCTCGTTTCCCAGCTGTTCACTTTCTCACCTCGTCGGCAT 

yAspLeuCysGlySerValPheLeuValSerGlnLeuPheThrPheSerProArgArgHis 

550 560 570 580 590 600 

GAGACAGTACAGGACTGCAACTGCTCAATCTATCCCGGCCACTTGACAGGTCATCGCATGG 

GluThrValGlnAspCysAsnCysSerlleTyrProGlyHisLeuThrGlyHisArgMetA 

10 620 630 640 650 660 

CTTGGGATATGATGATGAACTGGTCACCTACAACAGCCCTAGTGGTGTCGCATCTACTCCG 

laTrpAspMetMetMetAsnTrpSerProThrThrAlaLeuValValSerHisLeuLeuAr 

680 690 700 710 720 

GATCCCACAAGCTGTCATGGACATGGTGGCGGGGGCTCACTGGGGAGTCCTAGCGGGCCTC 

gl leProGlnAlaValMetAspMetValAlaGlyAlaHisTrpGlyValLeuAlaGlyLeu 

740 750 760 770 780 

GCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGATTGTGATGCTACTCTTCGCCG 

AlaTyrTyrSerMetValGlyAsnTrpAlaLysValLeuI leValMetLeuLeuPheAlaG 

800 810 820 830 840 

GCGTTGACGGGACCACCTATGTGACAGGGGGGACGACAGGCCGCACCACCAGCTCGTTCGC 

lyValAspGlyThrThrTyrValThrGlyGlyThrThrGlyArgThrThrSerSerPheAl 

860 870 880 890 900 

ATCCCTCTTTACACTTGGGTCGCATCAGAAGGTCCAGCTTATAAATACCAATGGCAGCTGG 

aSerLeuPheThrLeuGlySerHisGlnLysValGlnLeuIleAsnThrAsnGlySerTrp 

920 930 
CACATCAACAGGACCGCC 
HisI leAsnArgThcAla 
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Fig. 16 



, 10 20 30 40 50 60 

cgccggtatSagacggcgcaagactgcaattgctcactctatcccggtcacgtatctggt 

ArgArgTyrGluThrAlaGlnAspCysAsnCysSerLeuTyrProGlyHisValSerGly 

70 80 90 100 110 

CACCGCATGGCTTGGGATATGATGATGAACTGGTCACCTACAACGGCCCTAGTGGTATCGC 

HisArgMetAlaTrpAspMetMetMetAsnTrpSerProThrThrAlaLeuValValSerG 

130 140 150 160 170 

AGCTACTCCGGATCCCACAAGCCGTCGTGGACATGGTGGCGGGGGCCCACTGGGGAGTCCT 

InLeuLeuArglleProGlnAlaValValAspMetValAlaGlyAlaHisTrpGlyValLe 

ion 200 210 220 230 

AGCGGicCTTGCCTACTATTCCATGGTGGCGAACTGGGCTAAGGTCTTGGTTGTGA^^^^ 

uAlaGlyLeuAlaTyrTyrSerMetValAlaAsnTrpAlaLysValLeuValValMetLeu 



250 260 



270 280 290 



CTCTTTGCCGGCGTTGACGACGGGAAGACCACCGTGACGGGGGGGAGCGCAGCCTTCCAGT 
LeuPheAlaGlyValAspAspGlyLysThrThrValThrGlyGlySerAlaAlaPheGlnS 



o'TA 'IftO 390 400 410 420 

CAACGGCAGCTGGCACATCAACAGGACTGCCCTGAAtTGCAATGACTCCCTCCAAACT^^ 

risnGl ySerlrpHi s 1 1 eAsnArgThr Al aLeuAsnCys AsnAspSerLeuGl nThrGl y 

Ti?ATCGCTGcJCTGTTCTACGcScACAAGT^cLTTCGTcJGSATGCCTAGAG^ 
JleneAUATaLeSheT^^^ 

coA 530 540 

J?AGCTGCCGC??CATTGACAi2TTCGCGCAGGG^^^ 

laSerCysArgProrheAspLysPheAlaGlnGlyTrpGlyProIleThrHisAspThri-r 

550 
TAAGATCCCGG 

oLys 1 1 ePro 
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Fig. 17 



1 10 20 30 40 50 60 

GACACCGTATGGCATGGGACATGATGATGAACTGGTCGCCCACGGCTACCATGATTCTGG 

HisArgMetAlaTrpAspMetMetMetAsnTrpSerProThrAlaThrMetlleLeuA 

70 80 90 100 110 

CGTATGTGATGCGCATCCCCGAGGTCGTCATGGACATCATTGGCGGGGCTCACTGGGGCGT 

laTyrValMetArglleProGluValValMetAspIlelleGlyGlyAlaHisTrpGlyVa 

130 140 150 160 170 

CATGTTCGGCTTGGGCTATTTTTCTATGCAGGGGGCTTGGGCAAAAGTCGTTGTCATCCTT 

IMetPheGlyLeuGlyTyrPheSerMetGlnGlyAlaTrpAlaLysValValVallleLeu 

190 200 210 220 230 

CTGCTGGCCGCTGGGGTGGATGCGACTACCCTCAGCGTTGGGGGCTCTGCCGCGCACACCA 

LeuLeuAlaAlaGlyValAspAlaThrThrLeuSerValGlyGlySerAlaAlaHisThrT 

250 260 270 

CCGGCGGCCTTGTCGGCTTGTTCAAGCCTGGCG 

h rG 1 yG 1 yLeuVa I G 1 yLeuPheLysProG I y 



72 



EP 0 468 657 A2 



Fig. 18 

, 10 20 30 40 50 60 

CGCTTGTCGGCGCCCCCCTAGGGGGTGCTGCCAGGGCCCTGGCACATGGTGTCCGGGTTC 

LeuValGl yAl aProLeuG 1 yGl yAl aAl aArgAl aLeuAl aHi sGl yVal ArgVa 1 L 

70 80 90 100 110 

TGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCTATCTTCCT 

euGluAspGlyValAsnTyrAUThrGlyAsnLeuProGlyCysSerPheSerllePhei-e 

130 140 150 160 170 

CTTGGCTTTGCTGTCCTGTTTGACCATCCCAGCTTCCGCTTATGAGGTGCGCAACGT^ 

uLeuAl aLeuLeuSerCysLeuThr 1 1 eProAl aSerAl aTyrGl uVal ArgAsnVal Ser 

iQO 200 210 220 230 

GGGATATACCATGTCACGAACGACTGCTCCAACTCAAGTATTGTGTATGAGGCAGCGGACA 

GlylleTyrHisValThrAsnAspCysSerAsnSerSerlleValTyrGluAlaAlaAspM 

TGATyTGCATACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACAACTCCTCC^^^ 
ImeMetHlsTSproGlyCysValProCysValArgOluAsnAsnSerSerArgC^ 

no 320 330 340 350 360 

GGCAi?GCTCACTCCCACGTTAGCGGCCAGGAACACCAGCGTCCCCACTACG^ 
pAUAlaLeuThrProThrLeuAlaAlaArgAsnThrSerValProThrThrThrlleArg 

cggcStgtcgat??Sctcgttgg?Scggctgctttctgct^^^^^^ 

ArgHlsValAspLeuLeuValGlyAlaAlaAlaPheCysSerAlaMetTyrValGiyAspu 



ctattcca?Sgtggggaa??gggctaagg^tttgat^^^^^^ 

rTyrScrMetValGlyAsnTrpAlaLysValLeul leValMetLeuLeuPneRiau y 



700 710 ■'20 



740 
GACGGGACCAC 
AspGlyThr 
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Fig. 19 
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Fig. 20 
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Fig. 21 
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Fig. 22 
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Fig. 25 
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